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(57) Abstract 

The present invention relates to expression vectors 
comprising nucleic add sequences which encode an affin- 
ity ligand (e.g., an enTyme, epitope) and a modification 
recognition sequence. The vectors further comprise at 
least one restricdon site for the msertion of a nucleic add 
sequence capable of encoding a selected polypeptide. On 
expression, the resulting construct codes for a fusion pro- 
tern comprising an affinity ligand, the selected polypep- 
tide and a modification recognition sequence. The fusion 
protein may be isolated by virtue of the affinity ligand 
and then modified. The expression vectors may further 
comprise a nucleotide sequence encoding a cleavable link- 
er, such as a thrombin or factor Xa deavage sequence. 
The invention further relates to expression vectors con- 
taining a nucleotide sequence encoding a gene for a se- 
lected polypeptide and capable of directing the expression 
of the selected polypeptide as a fusion protein. In addi- 
tion, methods of produdng a modified fusion protein are 
disclosed. Modified or labeled fusion proteins of the pres- 
ent invention are useful in a variety of therapeutic, diag- 
nostic and research applications. 
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PLASMIDS FOR THE RAPID PREPARATION OF 
MODIFIED PROTEINS 

Description 



Background 

05 Current methods for the production of labeled 

proteins include iodination, biotinylation, or the in 
vivo labeling of protein. in vivo labeling methods 
require large amounts of radioactivity are required, 
and the protein of interest must be separated from 
10 other labeled cellular proteins. The biotinylation 
and iodination procedures also require a purification 
step for each individual protein and, in some cases, 
reaction conditions which can cause inactivat-ion of 
the protein. In addition, using these methods, 
15 modification of a protein may occur at a variety of 
sites, leading to distortions in structure or 
biological activity. 

For example, iodination protocols relying on 
chloraraine T or iodogen result in modifications at 
20 tyrosine residues and some histidine residues. 

Over-substitution and oxidation damage may result. 
Labeling procedures which use the Bolton-Hunter 
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reagent: result in the modification of free amino 
groups on lysine residues. In some proteins this 
particular modification may also have a deleterious 
effect on structure or activity. Although the 

05 lactoperoxidase method for iodination employs gentler 
conditions, this method also leads to modification of 
tyrosine and histidine residues, with the potential 
for structural distortion and loss of activity. 
Similarly^ biotinylation protocols are 

^0 frequently perfozrmed using a succinimide ester of 
biotin. The biotin is coupled to the protein through 
free amino groups, typically on lysine residues. 
Again, modification at one or more positions may 
alter structure and/or function of the protein. In 

15 addition, extensive dialysis is needed to remove 
uncoupled biotin, which may be deleterious to the 
protein.. 

Summary of the Invention 

The present invention relates to expression 
20 vectors comprising nucleic acid sequences which 

encode an affinity ligand (e.g., an enzyme, epitope) 
and a modification recognition sequence The vectors 
further comprise at least one restriction site for 
the insertion of a nucleic acid sequence capable of 
25 encoding a selected polypeptide in frame with the 
affinity ligand and modification sequence. On 
expression, the resulting construct codes for a 
fusion protein comprising an affinity ligand, the 
selected polypeptide and a modification recognition 
30 sequence. 
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The fusion protein may be isolated by virtue of the 
affinity ligand and then modified. 

The expression vectors may further comprise a 
nucleotide sequence encoding a cleavable linker, such 
05 as a thrombin or factor Xa cleavage sequence. The 
sequence encoding the cleavable linker is located 
between the affinity ligand and the restriction site 
for insertion of the sequences encoding the selected 
polypeptide. In this location, the linker may be 
10 cleaved to release the protein of interest following 
modification. The invention further relates to 
expression vectors containing a nucleotide sequence 
encoding a gene for a selected polypeptide and 
capable of directing the expression of the selected 
15 polypeptide as a fusion protein. 

The PGEX-2TK expression vector is one embodiment 
of the present invention. This vector encodes a 
protein comprising, from amino to carboxyl terminus, 
glutathione-s-transf erase (GST) as an affinity 
20 ligand, the thrombin cleavage site as a cleavable 
linker, and a phosphorylation recognition site for 
the cAMP-dependent protein kinase as a modification 
recognition sequence. A multiple cloning site 
comprising three restriction sites is located 
downstream of the sequence encoding the 
phosphorylation site. Thus, a nucleic acid sequence 
encoding a selected polypeptide may be inserted into 
the vector using one or more of these sites. The 
selected polypeptide is expressed as a GTK- fusion 
protein (G, GST; T, thrombin; K, kinase) . 



30 
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In addition r methods of producing a modified 
fusion protein are disclosed. Upon expression in a 
suitable host cell, the protein of interest is 
produced as a fusion protein • The fusion protein can 
05 be captured on a suitable affinity matrix by virtue 
of an affinity ligand, which interacts reversibly 
with the matrix. Modification of the fusion protein 
can be carried out while the protein is attached to 
the matrix. Subsequently , the modified fusion 
10 protein may be isolated for use by releasing the 
fusion protein from the affinity matrix with a 
suitable agent. In the case where a cleavable linker 
is present^ the fusion protein may be cleaved in 
vitro to free the modified polypeptide portion, and 
15 the affinity ligand portion or any uncleaved product 
can be removed by adsorption on the appropriate 
affinity matrix. Alternatively, the modified 
polypeptide portion can be released from the affinity 
ligand portion by cleaving the modified fusion 
^0 protein at the cleavable linker while still bound to 
the column. 

Modified proteins of the present invention are 
useful in a variety of applications. Labeled (e.g., 
radiolabeled) proteins may be used as molecular 

25 

probes. For example, antibodies can be labeled for 
therapeutic, diagnostic (e.g., imaging) or research 
purposes. Proteins may be labeled and used as 
reagents to quantitate or identify an interacting 
protein, such as a receptor. 
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Brief Descri ption of the DrawinaR 

Figure i shows a portion of the nucleotide 
sequence around the kinase recognition site of 
expression vector pGEX-2TK. The portion of the 
OS PGEX-2TK sequence introduced by the synthetic duplex 
is indicated by italics. The sequences which encode 
the linker cleavable by thrombin (Leu-Val-Pro-Arg- 
Gly-ser), the downstream kinase recognition site 
(Arg-Arg-Ala-Ser-Val) and multiple cloning site with 
BamHI, Smal and EcoRI sites, are shown. The arrow 
indicates the point of thrombin cleavage. 

Figure 2 is an illustration of the structure of 
expression vector pAR(ARI) 59/60. The general 
structure of the plasmid is shown at top. bla, 
indicates the /3- lactamase gene which confers 
ampicillin resistance; ori, indicates the origin of 
replication. in the center panel, the general 
structure of FEK-fusion proteins is illustrated. The 
FLAG peptide portion, comprising the FLAG epitope and 
enterokinase cleavable linker, is indicated. "hmk" 
indicates the location of .the phophorylation site. 
"Protein" indicates the location of the selected 
polypeptide. The lower panel shows a more detailed 
illustration of the N-terminal region of the vector. 
The peptide sequence shown (Met-Asp-Tyr-Lys-Asp-Asp- 
-Asp-Asp-Lys-Ala-Arg-Arg-Ala-Ser-Val-Glu-Phe-) in the 
detail is a contiguous sequence. The extent of the 
FLAG peptide, the enterokinase cleavage site, and HMK 
recognition (phosphorylation site) are indicated. 
Figure 3 is a bar graph showing the effect on 
P incorporation in vitro of an HMK sequence on 
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different fusion proteins, comprising portions of the 
retinoblastoma susceptibitliy gene product, RB. 
Proteins vere phosphorylated in vitro using 
cAMP-dependent protein kinase. The hatched bars 
05 indicate that the fusion protein (GST-RB379-792 or 
GST-RB379-792fpm706) was expressed from pGEX-2TK and 
contained an HMK sequence, while the solid bars 
indicate that the fusion protein (GST-RB379-792 or 
GST^RB379--792?pm70e) was expressed from pGEX-2T and 
10 did not contain an HMK sequence. 

Unless indicated otherwise, the orientation of 
particular amino acid sequences is such that the 
amino end is on the left and the carboxyl end is on 
the right. 

15 Detailed Description of the Invention 

The present invention relates to expression 
vectors comprising nucleic acid secjuences which 
encode an affinity ligand and a modification 
recognition sequence. The vectors further comprise 

20 at least one restriction site for the insertion of a 
nucleic acid sequence capable of encoding a selected 
polypeptide. The expression vectors may further 
comprise a nucleotide sequence encoding a cleavable 
linker sequence* These vectors are referred to as 

25 parent vectors. 

The present invention further relates to 
expression vectors which are derived from the parent 
vectors described above, by the insertion of a 
nucleic acid sequence capable of encoding a selected 
polypeptide (e.g. , a natural or synthetic cDNA or 



mSDOClD: <WO__9303157A1J_> 



SUBSrriTUTE SHEET 



wo 93/03157 



PCr/US92/06187 



-7- 



genomic DNA which encodes the selected polypeptide 
and is capable of being expressed in the appropriate 
host cell) into the parent vector. The parent 
vector can be cleaved at one or more restriction 
05 sites in order to insert sequences encoding a 

selected polypeptide using known techniques. Linkers 
or other sequences can be used to facilitate 
insertion. Insertion at an appropriate restriction 
site or site in the parent vector results in the in 
10 frame fusion of the sequences of the selected 
polypeptide with the amino acid sequences for an 
affinity ligand, modification recognition sequence 
and, if present, the optional cleavable linker 
encoded by the parent vector, on expression, the 
resulting fusion gene encoded by the vector codes for 
a fusion protein comprising an affinity ligand, a 
modification recognition sequence, selected 
polypeptide, and optionally, a cleavable linker. The 
20 restriction site or sites selected 

for insertion in the vector will determine the 
location of the selected polypeptide relative to the 
affinity ligand, modification recognition sequence 
and optional cleavable linker in the encoded fusion 
protein. The selected polypeptide element of the 
fusion protein comprises at least one peptide, 
polypeptide or protein of interest. 

The fusion protein comprising an affinity 
ligand, a modification recognition sequence, an 
optional cleavable linker and a selected polypeptide 
forms a contiguous polypeptide chain. The order of 
these components in the fusion gene and protein can 
vary. The location of these components or additional 
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components in the fusion gene and protein is 
determined by the order in which the nucleic acid 
sequences encoding the components are linked in the 
vector or the parent vector. For example, fusion 
05 proteins of the following structure can be made, 
where A indicates the affinity ligand, M indicates 
the modification recognition sequence, C indicates 
the optional cleavable linker, and X indicates the 
polypeptide of interest,, and the N-terminal portion 
iO of the fusion is on the left: 

A-(C)-M-X 

A-M-(C) -X 

X-(C)-M-A 

X-M-(C)-A 
^5 K-A-(C)-X 

M[-(C)-A-X 

Other permutations of the components are possible. 
However, the cleavable linker is preferably located 
between the affinity ligand and selected polypeptide 

^0 portions of the fusion protein. In addition, intro- 
duction of a selected polypeptide in frame is 
simplified by a terminal location for the polypeptide 
(X) . This location can also minimize the effect of 
fusion on the biological activity (e.g., binding 

^ activity, antigenicity, enzymatic activity) of the 
selected polypeptide. 

One or more components can be duplicated (e.g., 
X-M-C-A-X) . For example, multiple modification sites 
can be incorporated to increase the intensity of 
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labeling. These sites may be contiguous or 
noncontiguous in the encoded fusion protein. 
Furthermore, additional sequences (S) can be present 
in the fusion protein. For instance, a sequence 
encoding a signal peptide or leader peptide may be 
incorporated into the vector, and the signal peptide 
will be produced as part of the fusion protein. In 
the case of a signal peptide, the additional sequence 
is preferably incorporated at the N-terminus (e.g., 
S-X-M-C-A) - However, for other sequences, an 
internal or C-terminal location in the fusion protein 
may be desired. 

Note that on expression in a suitable host cell, 
the parent vector may also produce the affinity 
ligand, modification sequence, and optional cleavable 
linker sequences as a fusion protein. This product 
may contain additional sequences encoded by the 
vector, depending on the location of a promoter or 
termination signals relative to the coding sequences 
for these elements. In some cases, restriction sites 
for insertion of sequences encoding a selected 
polypeptide may disrupt the reading frame of 
components encoded by the parent vector. Insertion 
of sequences encoding a selected polypeptide will 
restore the reading frame. 

The expression vectors of the present invention 
can be designed for use in a variety of host cells, 
including bacterial host cells such as E. coli and 
eukaryotic host cells (e.g., yeast cells and 
mammalian cells). Thus, fusion proteins can be 
glycosylated. Like other expression vectors, in the 
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expression vectors of the present invention, a 
promoter is provided for expression of the fusion 
protein in a suitable host cell. Suitable promoters 
can be constitutive or inducible. In the vectors, 
the promoter is operably linked to nucleic acid 
sequences encoding the fusion protein and is capable 
of directing the expression of the corresponding 
polypeptide. A variety of suitable promoters for 
procaryotic (e.g., lac promoter, tac promoter) and 
exiJcaryotic hosts (e.g., yeast alcohol dehydrogenase 
(ADHl) , SV40) are available. 

In addition,, the expression vectors typically 
comprise a selectable marker for selection of host 
cells carrying the plasmid and an origin of 
replication, in the case of a replicabie expression 
vector. Genes encoding products which confer 
antibiotic resistance are common selectable markers 
and may be used in prokaryotic (e.g.^ ^-Iact2unase for 
ampicillin resistance, tetracycline resistance) and 
eukaryotic cells (e.g., G418) . Genes encoding the 
gene product of auxotrophic markers (e.g., LEU2 and 
URA3) are commonly used as selectable markers in 
yeast ♦ Use of viral or phage vectors ,^ and vectors 
which are capable of integrating into the genome of 
the host cell, such as retroviral vectors, are also 
contemplated^ The present invention also relates to 
cells carrying these types of expression vectors 
(e.g., transformed cells). 

The incorporation of one or more modification 
recognition sequences into the fusion protein allows 
the production of modified fusion proteins. As shown 
in the Examples, incorporation of a detectable label 
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(e.g., radioactive, fluorescent) at the modification 
site provides a convenient way to label the fusion 
protein. Thus, the invention further relates to the 
fusion proteins and modified (e.g. , labeled) fusion 
05 proteins produced from expression vectors of the 
present invention, it is also possible to produce 
fusion proteins of the present invention using 
methods of peptide synthesis, or in vitro 
transcription/translation procedures. Modification 
10 of synthetic fusion proteins is also possible. 

The affinity ligand present in the fusion 
proteins of the present invention is a polypeptide 
encoding an affinity ligand. The affinity ligand 
(i.e., an affinity ligand or portion thereof) is a 
i5 member of a specific binding pair and is capable of 
binding to a specific binding partner. Preferably, 
the binding of the affinity ligand to its specific 
binding partner is reversible, allowing recovery of 
fusion proteins comprising the affinity ligand where 
20 desired. Examples of affinity ligands include, but 
are not necessarily limited to, antibodies or 
portions thereof, antigens (e.g., influenza 
hemagglutinin) or epitopes (e.g., FLAG epitope), 
enzymes (e.g., glutathione-s-transf erase (GST), 
" ^-galactosidase ( lacZ ) , the trpE product) , hormones, 
growth factors or other proteins capable binding to a 
specific binding partner (e.g., maltose binding 
protein, histidine hexamer (a heavy metal binding 
element) , and protein A) . A specific affinity ligand 
30 is an affinity ligand comprising that protein or 
peptide or a portion thereof. Thus, a 
glutathione-S-transf erase affinity ligand is an 
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affinity ligand comprising glutathione-S-transf erase 
or a portion thereof, which is capable of binding the 
specific binding partner (e.g., a substrate). The 
affinity ligand portion of the fusion protein is 

05 useful for a variety of purposes, such as providing a 
moiety for attachment to a support or to facilitate 
purification or identification. 

For example, the fusion proteins can be 
captured on an appropriate affinity matrix. An 

10 affinity matrix is a solid support to which is 

attached (preferably covalently) a specific binding 
partner. Fusion proteins of the present invention, 
comprising an affinity ligand, can bind to a specific 
binding partner via the affinity ligand part of the 

15 fusion protein. In the case of an affinity ligand 
which is an antigen, an antibody or portion thereof 
can be used as a specific binding partner in an 
affinity matrix. Alternatively, an antigen or hapten 
may be incorporated into an affinity matrix, and used 

20 to capture fusion proteins comprising an antibody as 
an affinity ligand. A number of affinity 
ligand/affinity matrix pairs are available. 
Glutathione-S-transferase fusion proteins may be 
captured on immobilized glutathione as an affinity 

25 matrix, with glutathione as the specific binding 
partner. For example, glutathione sepharose 
(Pharmacia) or glutathione agarose beads (Sigma 
Chemical Corp.) can be used. Fusion proteins 
comprising a protein A, maltose binding protein, FLAG 
peptide, or a histidine hexamer affinity ligand can 
be captured on IgG Sepharose 6FF (Pharmacia) , any lose 
resin (ITew England Biolabs) , anti-FLAG Ml antibody or 
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anti-FLAG M2 antibody affinity resins (International 
Biotechnologies, Inc.), or a Ni^"^ affinity resin (NTA 
resin, Qiagen) , respectively. In the foregoing, IgG, 
amylose, anti-FLAG antibodies, or a heavy metal are, 
05 respectively, the specific binding partners. 

When desired, a fusion protein or modified (e.g. 
labeled) fusion protein bound to an affinity matrix 
can be released by contacting the affinity matrix 
with bound fusion protein thereto with a suitable 
10 elution buffer comprising one or more release 

components. The release component or components can 
be molecules which compete with the fusion protein 
for binding to the affinity matrix (e.g., hapten, 
free peptide epitopes, substrate or substrate 
analogs) , or which can disrupt binding of the 
affinity ligand to the specific binding partner. For 
example, an elution buffer (a buffered solution) 
suitable for releasing fusion proteins comprising a 
glutathione-s-transf erase affinity ligand can be 
formulated comprising reduced glutathione as a 
release component. In one embodiment, an elution 
buffer comprising reduced glutathione, Tris and NaCl 
is used. 

Alternatively, the elution buffer may comprise a 
buffered solution which lacks a specific component 
required for binding. For example, a fusion protein 
with an ompA signal sequence followed by a FLAG 
affinity ligand at the amino terminus can be 
expressed in E. coli. Specific removal of the ompA 
signal sequence upon secretion into the periplasmic 
space results in a fusion protein with an 
N-terminally located FLAG affinity ligand, which is 
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capable of binding to the anti-FIAG Ml antibody in 
tlie presence of calcixim* This type of fusion protein 
can be eluted from an anti-FLAG Ml antibody affinity 
matrix using an elution buffer which lacks calcium, 

05 a specific component required for binding.. 

The expression vectors of the present invention 
can further comprise a sec[uence encoding a cleavable 
linker. A nucleotide sequence capable of encoding 
such a cleavage site can be incorporated into an 

^0 expression vector using knovm techniques (e.g», 
recombinant ONA and/ or mutagenesis) . A cleavable 
linker is a peptide or polypeptide capable of being 
cleaved by a site specific protease. For example, a 
thrombin cleavage (e.g., Leu-Val-Pro-Arg-Gly-Ser) , 

^5 factor Xa cleavage site (e.g., Ile-Glu-Gly-Arg) or 
enter okinase cleavage site (e.g. , Asp-Asp- 
Asp-Asp-Irys) may be used as a cleavable linker. The 
appropriate protease (thrombin, factor Xa and enter- 
okinase, respectively) can be used to cleave a fusion 
protein containing a corresponding cleavable linker. 
Cleavage by a protease (proteolysis) can occur within 
a cleavable linker or at the border of the linker and 
another component of a fusion protein. 

The product of the cleavage reaction of a 
selected fusion protein comprising the majority of 
the affinity ligand is referred to as the affinity 
ligand portion, and the product of the cleavage 
reaction comprising the selected polypeptide is 
referred to as the selected polypeptide portion of 
30 the fusion protein. These portions may comprise 

other parts of the encoded fusion protein (i.e., the 
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fusion protein encoded by the expression vector) , and 
such are also fusion proteins. For example, the 
selected polypeptide portion can include the 
modification recognition sequence. If so, and if the 
05 fusion protein has been modified (e.g., labeled), 
then the selected polypeptide portion is referred to 
as the labeled selected polypeptide portion. Thus, 
the term fusion protein refers to a protein, 
polypeptide, or glycoprotein comprising at least two 
10 of the components selected from the group consisting 
of an affinity ligand, modification sequence, 
cleavable linker sequence, selected polypeptide or 
additional sequence. 

Preferably, a cleavable linker is selected which 
15 does not cleave elsewhere within the fusion protein 
(e.g., in the selected polypeptide, affinity ligand 
or modification site) . m addition, the sequences 
encoding the cleavable linker are preferably located 
between those sequences encoding the affinity ligand 
20 and selected polypeptide, m this location, the 

encoded fusion protein can be cleaved to separate the 
affinity ligand portion of the fusion protein from 
the selected polypeptide portion where desired. 
Similarly, where recovery of a labeled selected 
" polypeptide portion is desired, the sequences 

encoding the cleavable linker will be on located in 
the vector either upstream or downstream of the 
sequences encoding both the modification recognition 
sequence and the selected polypeptide (e.g., as in 
30 A-C-M-X or X-M-C-A fusions) . 

Overlap of the sequences of portions of the 
fusion protein may occur provided the respective 
portions retain function. For example, in one 
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embodiment of the present invention, pARC^Rl) 59/60, 
discussed in more detail below, the affinity ligand 
(FLAG epitope} sequence and cleavable linker sequence 
(enterokinase site) overlap by one amino acid* The 

05 sequence of the FLAG epitope from N- to C-terminus is 
(Asp-Tyr-Lys-Asp) , while the enterokinase cleavage 
site in this embodiment is (Asp-Asp-Asp-Asp-I*ys) . 
within pAR(ARl) 59/60, these sequences overlap to give 
the FLAG peptide, an octapeptide of the sequence 

10 Asp-Tyr-Lys-Asp-Asp-Asp-Asp-Lys, which retains the 
functions of the affinity ligand (e.g,, binding to a 
specific binding partner) and cleavable linker (e.g. , 
cleavage by enterokinase) . Similarly, other 
components of the fusion protein can overlap, 

15 providing their function is preserved. 

A modification recognition sequence is also pro- 
vided. This sequence, incorporated into the fusion 
protein, directs modification of the fusion protein. 
Modification recognition sequences can be 

20 incorporated into a fusion protein comprising a 
selected polypeptide which either is naturally 
modified or is not naturally modified- A 
modification recognition sequence for phosphorylation 
(i.e., a phosphorylation site) can be introduced into 

^ a vector and expressed as part of a fusion protein, 
for example. In the presence of a suitable protein 
kinase, the fusion protein comprising the 
phosphorylation (kinase) site will be phosphorylated. 
Other types of modifications directed by the presence 

30 of a peptide or polypeptide sequence are envisioned 
in the present invention. For example, glycosylation 



SUBSTITUTE SHEET 



wo 93/03157 



PCT/US92/06187 



-IT- 



OS 



10 



15 



reactions can be directed by a short peptide 
sequence. Similarly, fatty acylation of some 
proteins is directed by a short peptide sequence 
(Cys-Ala-Ala-X) . The identification of additional 
recognition sites and the identification and 
pxirification of the corresponding modification enzyme 
or enzyme complex will provide alternative protocols 
for modification of the fusion proteins. 

Modification of the fusion proteins is carried 
out using a modification enzyme or enzyme complex or 
other suitable means. In the case of a 
phosphorylation reaction, a suitable phosphorylation 
method is used. Typically, the phosphorylation of 
proteins is carried out by a protein kinase. Many 
such kinases have been described and have utility in 
the present invention (see e.g., Kemp, B.E., et al . . 
J. Biol. Chem., 252J 4888-4894 (1977); Edelman, A.M. 
®^_jyL- » Ann. Re v. Biochem . 56 : 567-613 (1987),- Glass, 
D.B. and E.G. Krebs, Ann. Rev. Pharmacol. Toxicol. 
20 20: 363-388 (1980); Hunter, T. and J. A. Cooper, Ann. 
Rev. Biochem. 54: 897-930 (1985); Hunter. T. , cell 
50: 823-829 (1987)). Known protein kinases catalyze 
the transfer of the 7 -phosphate group of ATP to the 
hydroxyl groups of serine and/or threonine residues 
on specific substrates or alternatively, catalyze the 
phosphorylation of tyrosine residues. Use of protein 
kinases of both specifities is contemplated in the 
present invention. Serine/threonine kinases include 
cyclic AMP (CAMP) dependent and cyclic GMP (cGMP) 
dependent protein kinases, and cyclic nucleotide- 
independent protein 
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kinases. A variety of serine/ threonine kinases have 
been identified including glycogen synthase kinase, 
phosphorylase kinase, casein kinase I casein kinase 
II, pyruvate dehydrogenase kinase, protein kinase C 
05 and myosin light chain kinase. 

Amino acid sequences present in natural 
substrates and artificial peptide substrates which 
are sufficient for activity as a protein kinase 
substrate have been identified (see e.g., Edelman, 
10 A.M- et al ., Ann. Rev> Biochem . 56 : 567-613 (1987); 
Glass, D.B. and E.G. Krebs, Ann. Rev. Pharmacol. 
Toxicol . 20 ; 363-388 (1980); Hunter, T- and J. A. 
Cooper, hrm. Rev. Biochro . 54 : 897-930 (1985)). These 
amino acid sequences, in addition to those described 
^5 belov, and related amino acid sequences which can be 
specifically phosphorylated, can be used as 
modification recognition sequences in vectors of the 
present invention. A corresponding protein kinase, 
capable of phosphorylating a polypeptide comprising 
-0 the selected kinase recognition secjuence, can be used 
to phosphorylate fusion proteins of the present 
invention. For example, the cAMP-dependent protein 
kinase (e.g., the catalytic subunit of the 
cAMP-dependent protein kinase from bovine heart 
muscle> can recognize the consensus amino acid 
sequence Arg-Arg-Xaa-Ser-Xaa, where Xaa is an amino 
acid, in a variety of substrates, resulting in 
phosphorylation of serine. As shown in the Examples, 
expression vectors of the present invention (e.g., 
0 PGEX-2TK derivatives, or pAR(ARI) 59/60 derivatives), 
incorporating a sequence which encodes a version of 
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the cAMP-dependent protein kinase recognition 
sequence (Arg-Arg-Ala-Ser-Val) , can direct the 
production of fusion proteins capable of being 
phosphorylated by cAMP-dependent protein kinase from 
bovine heart muscle. The Arg-Arg-Ala-Ser-Val 
sequence is also referred to as the HMK site (HMK, 
heart muscle kinase) . other specific amino acid 
sequences, such as Arg-Arg-Ala-Ser-Leu (Li et al . . 
Proc. Natl, sci. USA 86: 558-562 (1989) and the 
peptide Arg-Thr-Lys-Arg-Ser-Gly-Ser-Val, can be 
recognized and phosphorylated by this kinase. The 
phosphorylation sites can function at a terminal or 
internal location within a fusion protein. 

cGMP protein kinases have a substrate 
specificity that is similar, but not identical, to 
that of the cAMP-dependent protein kinases, 
comparative analysis of substrates has been made 
(Edelman, A.M. et al ., Ann. Rev. Biochem . 56: 
567-613, (1987); Glass> D.B. and E.G. Krebs, Ann. 
Rev. Pharmacol. Toxicol, 363-388 (1980); Glass, 

D.B. and E.G. Krebs, J. Biol. Chem. 254 : 9728-9738 
(1979)). The substrate specificities of the two 
types of casein kinases has also been studied, and 
the activity of peptide substrates has been compared 
(Marin, o. et al., Eur, j. Biochem. 160 : 239-244 
(1986); Sommercorn, J. and E.G. Krebs, J. Biol. Chem . 
262: 3839-3843 (1987); Kuenzel, E.A. et al . , J. Biol. 

262: 9136-9140 (1987)). For example, casein 
kinase II was shown to phosphorylate the synthetic 
30 peptide Ser-Glu-Glu-Glu-Glu-Glu. Additional peptide 
substrates were phosphorylated by casein kinase II 
(e.g., in decreasing order of activity. 
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Arg-Arg^Arg-Asp-Asp-Asp-Ser-Asp-Asp-Asp ; Arg-Arg- 
Arg-Glu-Glu-Glu-Ser-Glu-Glu-Glu ; Arg-Arg-Arg-Glu-Glu- 
Glu-Thr-Glu-Glu-Glu) . The substrate specificity of 
tyrosine kinases has also been studied (Hunter,. T. 
and J.A, Cooper, Ann> Rev, Biochin > 54 ; 897-930 
^5 (1985)). Candidate phosphorylation sites 

(recognition sequences) can be incorporated into 
synthetic peptides or into fusion proteins and 
assayed for activity as protein kinase substrates, 
using techniques similar to those described 
previously^ Because corresponding phosphatases 
exist, enzymatic removal of phosphate groups is also 
possible. 

The ability to modify the fusion proteins 
provides a convenient method of specifically labeling 
the fusion proteins with detectable radioactive or 
non-radioactive labels to produce a labeled fusion 
protein (e.g., a ["^^P] -labeled fusion protein). 
Cleavage of fusion proteins can produce a selected 
polypeptide portion comprising a modification 
sequence or an affinity ligand portion comprising a 
modification sequence, depending on the location of 
the cleavable linker in relation to these components. 
Thus, cleavage following labeling can produce a 
labeled selected polypeptide portion or a labeled 
affinity ligand portion, each of which is itself a 
fusion protein. Because the modification recognition 
sequence directs modification to a specific site in 
the protein, there is a high degree of control over 
the location and extent of modification. As shown in 
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the Examples, fusion proteins expressed and modified 
by the introduction of a detectable label retain both 
structure and function. 

The detectable label (e.g., radioactive, 
fluorescent or chemiluminescent) selected will 
determine by the nature of modification, the 
modification enzyme, and the intended use. The label 
will be incorporated into a moiety which is 
transferred to the fusion protein substrate. For 
phosphorylation reactions, [7 -labeled] ATP is used as 
phosphate donor (i.e., modification donor). As 
protein kinases transfer the 7-phosphate onto the 
substrate during the phosphorylation reaction, the 
label will be incorporated into the moiety 
transferred by the protein kinase. For example, to 
incorporate a radioactive phosphate label such as 

P, P or P, phosphate donors such as [7*^''"P]ATP, 
[t P]ATP, or [t^^P]ATP can be used. Alternatively, 
isotopes of sulfur, such as ^^S or ^^S isotopes, can 
be incorporated as the label in the [7-labeled] ATP, 
as for example in "^^S-labeled adenosine 
5' [7-thio] triphosphate. The term "phosphorylation" 
also refers to modification with such thiophosphate 
analogs or other 7 -labeled analogs of ATP. Other 
considerations such as specific activity, half -life, 
type of particle emitted and energy of radiation will 
influence selection of an appropriate radioactive 
label. In addition, a fluorescent moiety or 
chemiluminescent moiety incorporated into the 
7-phosphate. Detection methods for such labels are 
well known in the art. 
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In other types of modifications (e*g,, glyco- 
sylation, fatty acylation) , a radioactive isotope, 
chemiluminescent or fluorescent dye, or other 
nonradioactive label, could be incorporated into the 

05 modification donor so that it is transferred to the 
fusion protein during the modification reaction. For 
example, a biotin adduct of a modification donor 
could be linked to a fusion protein. In the present 
method, the site of addition of the biotin label is 

10 controlled by the location of the modification 
recognition sequence. 

Specific Embodiments 

Xn one embodiment of the present invention, 
glutathione-S-transf erase (GST) from the psurasitic 

15 helminth Schi stosoma i aponica (Mr— 26,000) is selected 
as the affinity ligand. However, GST genes from 
other sources, such as other bacteria or mammalian 
organisms, can be used. In addition, a portion of a 
GST gene encoding a portion of GST capable of binding 

20 a specific binding partner, such as the substrate 

glutathione, can be used. In particular, expression 
vectors capable of expressing S. japonicum GST-fusion 
proteins (i.e., a fusion protein comprising a GST 
affinity ligand) were constructed. The parent vector 

25 is named pGEX-2TK, and vectors derived from pGEX-2TK 
by the insertion of a nucleotide sequence encoding a 
selected polypeptide, are referred to with the prefix 
pGTK-. The construction of pGEX-2TK and pGTK- 
plasmids is described in Examples 1 and 2 and in 

3° Figure 1- 
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PGEX-2TK Is a derivative of pGEX-2T, the latter which 
is described by D.B. Smith in EP 0,293,249, p\ablished 
November 30, 1988, PCT/AU88/ 00164 , published December 
1, 1988, and New Zealand Patent No. 224,663, issued 
November 27, 1990, and by D.B. Smith and K.S. Johnson 
in Gene 67: 31-40 (1988). The teachings of EP 
0,293,249, PCT/AU88/ 00164, New Zealand Patent No, 
224,663, and smith and K.S. Johnson ( Gene 67: 31-40 
(1988)) are herein incorporated by reference. 

PGEX-2TK encodes a fusion protein having GST as 
an affinity ligand at the amino terminus, followed by 
a thrombin cleavage site (cleavable linker), in 
addition, a modification recognition sequence for a 
protein kinase was incorporated downstream of the 
cleavable linker. In particular, a sequence of the 
structure Arg-Arg-Xaa-Ser-Xaa , where Xaa is an amino 
acid, was selected (Arg-Arg-Ala-Ser-Val) . This 
sequence is a phosphorylation site recognized by 
cAMP-dependent protein kinases such as the catalytic 
domain of the cAMP-dependent protein kinase from 
bovine heart muscle. In pGEX-2TK, the sequence which 
encodes the thrombin cleavage site is followed by 
several restriction sites for the insertion of 
sequences encoding a selected polypeptide. pGEX-2TK 
retains the inducible tac promoter for expression, 
three in frame stop codons, a selectable marker 
(ampicillin resistance), origin of replication, and 
laclq gene present in pGEX-2T. 

When a secjuence which encodes a selected 
polypeptide is inserted in frame into the pGEX-2TK 
expression vector downstream of the phosphorylation 
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site, a GTK- fusion protein comprising the selected 
polypeptide can be produced. The GTK-fusion protein 
(where, from N- to C-terminus, G is a GST affinity 
ligand, T is a thrombin cleavage site, and K is a 
phosphorylation site for a protein kinase) is encoded 
by a pGTK-vector, can be produced upon expression in 
a suitable E« coli host. These fusion proteins can 
be captured on immobilized glutathione, and labeled 
by phosphorylation with cAMP-dependent protein 
^0 kinase, using [7-"^^P]ATP as a modification donor. As 
shown in the Examples, several different cDNAs were 
expressed from pGEX-2TK, and labeled as described. 
The studies described in Examples 1*4 indicate that a 
selected polypeptide present in a GTK-fusion protein, 
encoded by a pGTK-type vector, such as pGTK-^RB 
(379-792) , can retain both structure and function. 
Fiirthermore,. the labeling (phosphorylation) procedure 
was efficient and did not alter the structure or 
function of the selected polypeptides. 

In another embodiment of the present invention, 
the FLAG epitope is selected as the affinity ligand. 
Vector pAR(ARI) 59/60, shown in Figure 2, is an 
example of this type of expression vector. Vector 
pAR(ARX) 59/&0 is a derivative of bacterial expression 
plasmid pET3a (pAR3040) described by Studier et al. , 
Meth. Enzymol^ 185 ; 60-89 (1990), the teachings of 
which are herein incorporated by reference « The 
construction of pAR(ARI) 59/60 is described in detail 
in Example 5. In this construct, the FLAG peptide is 
30 located adjacent to the N-terminal initiator 
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methionine (Figure 2) . The FLAG octapeptide 
(Asp-Tyr-Lys-Asp-Asp-Asp-Asp-Lys) comprises the FLAG 
epitope (Asp-Tyr-Lys-Asp) affinity ligand and an 
overlapping enterokinase cleavable linker 
05 (Asp-Asp-Asp-Asp-Lys) . Cleavage with enterokinase 
typically occurs precisely after the terminal lysine 
(underlined above) of the FLAG octapeptide. The FLAG 
octapeptide is followed by a kinase recognition site. 
In particular, in pAR(ARI) 59/60, the site is 
Arg-Arg-Ala-Ser-Val (HMK) , recognized by cAMP- 
dependent protein kinase. A unique EcoRI restriction 
site located downstream of the sequence encoding the, 
phosphorylation site can be used for the insertion of 
a sequence encoding a selected polypeptide. In 
plasmids derived from pAR(ARI) 59/60, referred to with 
the prefix pFEK- F for FLAG, E for enterokinase, and 
K for the phosphorylation recognition site) , in which 
a sequence encoding a selected polypeptide has been 
inserted, the encoded fusion protein, referred to 
herein as a FEK-fusion protein, has the following 
general structure for N- to C-terminus: Met- [FLAG/ 
enterokinase cleavable linker] -phosphorylation 
site-selected polypeptide. An alanine residue is 
located between the FLAG peptide and HMK site (see 
25 Figure 2) . m addition, the EcoRI site encodes a 
Glu-Phe dipeptide. Depending on the strategy for 
insertion of a sequence encoding a selected poly- 
peptide, additional residues (e.g., encoded by a 
linker or PCR primer) may be inserted between the 
selected polypeptide and the modification recognition 
(HMK) site. In many cases, however, only 17 amino 
acids will be fused to the selected polypeptide. 



20 



30 



BNSDOCID: <WO_93031 57A1J_> 



suBsmruTE sheet 



wo 93/03157 



PCr/US92/06187 



-26- 

pFEK-plasmids can direct the expression of FLAG- 
fusion proteins (i.e., fusion proteins having a FIAG 
epitope affinity ligand) from the T7 polymerase 
promoter in an appropriate host (e.g., a bacterial 
05 cell capable of constitutive or inducible expression 
of the T7 polymerase; for examples of suitable hosts 
and induction protocols, see Studier et al. , Meth> 
Enzymol> 185 ; 60-89 (1990)). The construction of 
several pFEK-plasroids is described in Example 5. The 
encoded fusion proteins were expressed in a bacterial 
host, and a lysate was prepared. Partially purified 
fractions containing each fusion protein were 

subjected to phosphorylation with the catalytic % 
subunit of the cAMP-dependent protein kinase using 

15 [y-'^^PJATP as a (labeled) modification donor. The 
FEK-fusion proteins were labeled to high specific 
activity to give £ P] -labeled FEK-fusion proteins 
(which are also [^^P]-labeled FLAG-fusion proteins) . 
Note that vectors of the present invention 

20 related to pGTK- and pFEK- vectors can be constructed 
lacking the cleavable linker sequence. These vectors 
would have a pGK- or pFK- prefix. A fusion protein 
comprising a selected polypeptide expressed from a 
pGK- or pFK-vector is referred to as pGK-fusion 

25 protein or pFK- fusion protein, respectively. 

A variety of commercially available vectors 
comprising an affinity ligand and cleavable linker 
are available which can be modified by the 
introduction of a modification site (e.g. , a 
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phosphorylation site) , and where required, one or 
more restriction sites for the in-frame insertion of 
a selected polypeptide using recombinant DNA 
techniques. For example, (i) the pGEX-3X vector 
°5 (Pharmacia) comprising a GST affinity ligand and 
Factor Xa cleavable linker, (2) the pMAL-C vector 
(Biolabs) comprising a malB affinity ligand and 
Factor Xa cleavable linker, (3, 4) the protein A gene 
fusion expression vectors, pRlT2T and pRlT5 
10 (Pharmacia), comprising a protein A affinity ligand, 
(5, 6) as well as pDS and pQE-vectors (Qiagen) , 
comprising a histidine hexamer affinity ligand, could 
be modified in this way. The vector pRIT2T, vector 
PRIT5, pDS and pQE-vectors, could be further modified 
by the insertion of a sequence encoding a cleavable 
linker. Convenient affinity matrices for fusion 
proteins encoded by the foregoing were discussed 
above . 



20 



Methods of Pr oducing a Modified Fusion Protein 

The present invention further relates to methods 
for producing a modified fusion protein. In 
particular, a rapid and convenient method for 
labeling a fusion protein using a radiolabel or 
non-radioactive label donated by [7-labeled]ATP is 
25 provided. m addition, the methods used are mild; a 
feature which is important for preservation of the 
structure and biological activity (e.g., binding, 
antigenicity, activity) of the components of the 
fusion protein, and of the selected polypeptide in 
30 particular. 
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For example, fusion proteins of the present 
invention comprising an affinity ligand portion and a 
selected polypeptide portion are expressed in a host 
cell carrying a vector of the present invention. The 

05 host cells are propagated under conditions which 
permit expression of the vector. For example, 
expression from the particular pGTK- and pFEK- 
vectors described in the Examples requires induction 
by IPTG. The host cells are lysed using known 

10 technicpies to obtain a lysate containing the fusion 
protein. 

In one embodiment, the lysate can be crudely 

fractionated as in Example 5 and directly labeled in 

32 

the presence of [7-labeled]ATP (e.g., [7 - PJATP) , 
^5 and a cAMP-dependent protein kinase, such as the 
catalytic subunit of the cAHP-dependent protein 
kinase from bovine heart muscle. A suitable 
foirmulation for a kinase reaction buffer and reaction 
stop buffer is given below. 

^0 In another embodiment |. the fusion protein is 

modified (e.g. , phosphorylated) while bound to an 
affinity matrix. The fusion protein present in the 
lysate is captured on an affinity matrix. This step 
is carried out by contacting the lysate with an 

^ appropriate affinity matrix (i.e., an affinity matrix 
comprising a specific binding partner of the affinity 
ligand present in the fusion protein) , under 
conditions which permit binding of the affinity 
ligand portion of the fusion protein to the affinity 

30 matrix. Suitable conditions for a variety of 

affinity ligands and matrices are known in the art. 
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For example, GST-fusion proteins (e.g., a 
GTK-fusion) can be captured on immobilized 
glutathione as an affinity matrix. As shown in 
Example 2, the GST— fusion protein can be captured on 

05 the affinity matrix in the presence of a wash buffer, 
(which was also used as the lysis buffer) • The 
components of the wash buffer permit binding of the 
GST portion of the fusion proteins to the matrix via 
the specific binding partner (glutathione) . In one 

10 embodiment, the wash buffer comprises a buffer such 
as Tris, salt (e.g., NaCl) , a chelator (e.g., EDTA) , 
and a non-ionic detergent such as nonidet P-40. 

As discussed above, FEK-fusion proteins can be 
captured on an anti-FLAG antibody affinity matrix. 

15 When the FLAG peptide is internal (e.g., not 

immediately at the N-terminus, as in pAR(AiRI) 59/60) , 
anti-FLAG M2 antibody can be used as the specific 
binding partner. Suitable conditions and 
formulations (e.g., for wash buffer and elution 

20 buffer) for anti-FLAG M2 affinity chromatography can 
be obtained from International Biotechnologies, Inc. 

Once bound, the fusion proteins can then be 
washed in order to remove unbound material (e.g., 
contaminating proteins other than the fusion 

25 protein) . The wash buffers described above are 
suitable for this purpose. 

The affinity matrix with bound fusion protein 
attached is then equilibrated (contacted) with a 
reaction buffer suitable for the modification 

30 reaction. In a phosphorylation reaction, for 
example, a kinase buffer such as HMK buffer is 



3NSD0CID: <WO_93031 57A1J_> 



SUBSTITUTE SHEET 



wo 93/03157 



PCT/US92/06187 



-so- 
selected. The particular kinase reaction buffer vill 
be adapted to the kinase selected. Suitable reaction 
buffers for a variety of characterized kinases are 
^v^^ilable. HMK reaction buffer, suitable for use 
with the cAMP-dependent protein kinase from bovine 
heart muscle (HMK), comprises (1) a buffering agent 
such as Tris, (2) a reducing agent such as 
dithiothreitol (DTT> , (3) a salt such as NaCl, and 
(4) a source of magnesium ions or other appropriate 
ion, such as MgCl^. Although the DTT can be added 
separately, it is present in the kinase reaction 
buffer. In addition, a modification enzyme and 
modification donor are added. in one particular 
phosphorylation reaction, a protein kinase prepara- 
tion comprising the HMK enzyme is added, and [7- 
labeledJATP. Typically, the protein kinase will be 
diluted in a suitable solution prior to addition. In 
the examples, a variety of fusion proteins were 
efficiently radiolabeled by phosphorylation using HMK 
reaction buffer, the catalytic subunit of the HMK and 
[7- P]ATP as a modification donor. 

The conditions appropriate for phosphorylation 
of the bound fusion protein to produce a bound 
labeled (e.g., radiolabeled) fusion protein will vary 
with the protein kinase selected. Typically, the 
steps subsequent to culturing the host cell are 
carried out at 4 **C. However, phosphorylation 
reactions with the catalytic subunit of the 
cAMP-dependent protein kinase from bovine heart 
muscle, have previously been carried out at 37 «>c* 
In the present method, phosphorylation can be carried 
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out at 37 **C. However, in another embodiment of the 
present invention, efficient phosphorylation is 
carried out at 4 **C, This temperature can preserve 
the activity of fusion proteins, and of the selected 

05 polypeptide portion in particular. 

Optionally, the reaction can be quenched by 
addition of a stop buffer comprising a component 
capable of inhibiting the modification reaction. The 
component capable of inhibiting the reaction will 

10 vary with the modification enzyme, but can include 
inhibitors (e.g., reaction products, competitors), 
chelators, or other agents which stop the reaction. 
It can be desirable to stop a reaction when, for 
example, in subsequent steps, undesired modifications 

15 can occur. However, wash steps can serve to remove 
modification enzymes. 

For example, in a phosphorylation reaction with 
cT^P-dependent protein kinase, an HMK stop buffer can 
be added, comprising a buffer such as sodium phos- 

20 phate, a reaction inhibitor such as sodium pyrophos- 
phate, and a chelator such as EDTA. HMK stop buffer 
can optionally contain a carrier such as bovine serum 
albumin or glycerol. As a reaction product of a 
phosphorylation reaction, sodium pyrophosphate 

25 inhibits the kinase reaction. In addition, chelating 
Mg^"*" ions inhibits the reaction. 

An advantage of the present method is that 
unincorporated label is easily removed by washing the 
affinity matrix with bound labeled fusion protein. 

30 The wash buffers described above, which permit 

binding of the affinity ligand to the affinity matrix 
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via the specific binding partner can be used in this 
step. Other methods for labeling proteins with the 
HMK enzyme have relied upon extensive dialysis to 
remove unincorporated label (e.g., Zhao, X.-X. et 
05 al. , Analvt, Biochem > 178 ; 342-347 (1989)). The 
removal of unincorporated label by washing is 
advantageous because extensive dialysis can 
compromise the function of some proteins and results 
in the production of large quantities of contaminated 
^0 dialysis buffer. 

If desired, the modified fusion protein may be 
isolated for use by releasing the fusion protein from 
the affinity matrix by washing with a suitable 
elution buffer. Elution buffers were discussed 
^5 ^^bove. For example, a labeled GST-fusion protein 
such as a GTK-f usion protein can be eXuted from 
immobilized glutathione with an elution buffer 
comprising glutathione (e.g., reduced glutathione). 
In one embodiment, an elution buffer coiaprising 
^0 reduced glutathione, a buffer such as Tris, and a 
salt such as NaCl is used to release labeled 
GST-fusion proteins from the affinity matrix. 

In the case where a cleavable linker is present, 
after elution ,^ the fusion protein can be cleaved in 
'5 vitro to free the modified (e.g., labeled) 

polypeptide portion. Cleavage is accomplished by 
contacting the fusion protein with an appropriate 
specific protease under conditions which permit the 
cleavage reaction. Such conditions for cleavage by 
° enterokinase, thrombin and factor Xa, for example, 
are known. The affinity ligand portion or any 
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uncleaved product can be removed by adsorption on the 
appropriate affinity matrix in the presence of wash 
buffer for example. The modified or labeled selected 
polypeptide portion can be recovered in the 

05 supernatant. 

Optionally, prior to elution from the affinity 
matrix, the modified polypeptide portion can be 
released from the affinity ligand portion by cleaving 
the modified fusion protein at the cleavable linker. 

10 An example protocol for cleavage by a specific 

protease on a column is provided in Abath, F. and A. 
Simpson, Biotechniques 10 : 178 (1991), the teachings 
of which are herein incorporated by reference. 

Smith (EP 0,293,249; PCT/AU88/00164 ; NZ 224,663) 

15 and Smith and Johnson ( Gene 67 ; 31-40 1988)) also 
describe conditions for capture of GST-fusion 
proteins on immobilized glutathione, washes, elution 
and cleavage protocols. 

In one embodiment of the present invention, a 

20 kit for preparing a labeled fusion protein is 

provided, comprising (1) an affinity matrix to which 
a portion of the fusion protein can bind, (2) a wash 
buffer which permits binding of the fusion protein to 
the affinity matrix, (3) a modification reaction 

25 buffer such as a protein kinase reaction buffer, and 
(4) a modification enzyme preparation such as a 
protein kinase preparation. Optionally, the kit can 
contain a reaction stop buffer or an elution buffer 
comprising a release components The nature of these 

30 elements has been explained above in more detail. 
In another embodiment, a kit can comprise a 
vector of the present invention (e.g., pGEX-2TK or 
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pAR£ARlj59/60) . The vector can be present in a 
suitable host cell or free from the host cell. 
Furthermore, the kit comprises each of the elements 
(1) through (4) of the kit described above, and if 
desired, a reaction stop buffer. in other 
embodiments, a modification donor such as 7 -labeled 
ATP can also be included in the kit. 

In a phosphorylation reaction, a protein kinase 
preparation is provided comprising a protein kinase. 
In the protein kinase preparation, the kinase may be 
in the form of a precipitate or solution. The kinase 
can be dissolved or diluted as necessary prior to 



use. 



The aff init:y matrix can be present in a suitable 
containing means (e.g., a vial or cartridge), m one 
embodiment, the affinity matrix is provided in a 
cartridge unit, the combination of which acts as a 
column. Various components (e.g., the lysate, wash 
buffers, label) can be inserted into the cartridge 
and passed through the affinity column to facilitate 
the labeling procedure. In one embodiment, 
immobilized glutathione can be incorporated into a 
cartridge unit for capture of GST-fusion proteins. 

Uses o f the Present Invention 

Vectors of the present invention can direct the 
expression of large amounts of a fusion protein in a 
host cell. These fusion proteins can be conveniently 
labeled using the methods described, and have many 
uses in research, diagnostic and therapeutic 
applications. 
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For example, radiolabeled proteins are useful in 
imaging, in one embodiment, radiolabeled fusion 
protein or radiolabeled selected polypeptide portion 
can be used to locate cells carrying a ligand 
05 recognized by the selected polypeptide (e.g., a 

receptor, cell surface component, or antigen). For 
example, a growth factor can be used as the selected 
polypeptide, and a labeled fusion protein of the 
present invention comprising the growth factor can be 
10 used to detect cells carrying a receptor for the 
growth factor. 

Similarly, incorporation of a suitable 
radiolabel permits the use of fusion proteins of the 
present invention in targeted radiotherapy. in other 
15 words, the selected polypeptide (e.g., a hormone, 
growth factor, antibody) present in the fusion 
protein can seek out specific cells and damage or 
destroy them due to the attached radiolabel. An 
isotope with the desired energy and half-life can be 
20 selected for this purpose. Other labels incorporated 
by methods of the present invention and capable of 
killing cells can also be used. 

For example, a labeled antibody or portion 
thereof (e.g., F Fab, etc.) produced by the method 
of the present invention could be used in this 
manner. Such fusion proteins would be useful for 
many other applications as well. The term antibody 
as used herein refers to antibodies such as chimeric 
antibodies, single chain antibodies, bifunctional 
'° antibodies and other antibody variants, including 
individual chains (e.g., a heavy chain). A variety 
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of methods for expressing immunoglobulins are 
available (see e.g., Skerra, A* and A. Pluckthun, 
Science 2401 1038-1041 (1988)). 

In one embodiment, antibody chains are expressed 

05 as fusion proteins of the present invention in E. 
coli . A signal sequence (e.g., ompA, phoA) is fused 
to the N-terminus of an antibody chain, which is 
followed by at least one modification recognition 
sequence,, an optional cleavable linker, and an 

10 affinity ligand. On expression in E. coli ,. the 

signal sequence is cleaved from the fusion protein, 
freeing the amino terminus of the antibody chain, and 
preserving the binding function of the variable 
region^ Fusion genes encoding both chains can be 

15 expressed in this manner from two vectors or a single 
vector encoding two fusion genes. Alternatively, 
either the heavy or light chain could be expressed as 
a fusion protein of the present invention and the 
complementary chain could be expressed in the same 

20 cell using typical antibody expression vectors. 

In addition, as shown herein, labeled fusion 
proteins of the present invention can be used as 
probes. As shown in Example 4, labeled fusion 
proteins such as P-6TK-fusion proteins, can be used 
as probes for screening expression libraries for 
proteins capable of binding the selected polypeptide. 
For example, a fusion protein of the present 
invention comprising a selected polypeptide is 
expressed, purified by virtue of the affinity ligand, 
and specifically labeled at the modification 
recognition sequence. The labeled, homogeneous 
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fusion protein is then incubated with filters on 
which proteins expressed by individual plaques of a 
target library have been immobilized. Following this 
hybridization step, the filters are washed, and 

05 processed for identification (i.e., by detection of 
the label) of plaques which produce recombinant 
proteins capable of specifically interacting with the 
labeled fusion protein probe. 

In particular, a cDNA encoding a portion (the E7 

10 binding domain) of the retinoblastoma susceptibility 
gene product (pRB) was prepared as a GTK-fusion 
protein and used to screen a library for proteins 
which interact with pRB. A number of positive clones 
were isolated. One of these clones, designated 

15 RBAPl, displays properties expected of a cellular 
protein capable of interacting with pRB. Labeled 
fusion proteins of the present invention comprising a 
selected polypeptide can also be used as probes in 
Western blot formats (Example 4) . 

In addition, as shown in Example 4, labeled 
fusion proteins of the present invention can be used 
to identify proteins present in complex mixtures 
(e.g., whole cell extracts), which are capable of 
interacting with the selected polypeptide. The 

25 

fusion protein^ can be used in protocols to isolate 
the interacting proteins. For example, the 
interacting proteins can be captured by the selected 
polypeptide portion of a labeled fusion protein which 
is bound to an affinity matrix. Subsequent cleavage 
30 at a cleavable linker could release the complex of 
the interacting protein and a labeled selected 
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polypeptide portion for further analysis ♦ 
Alternatively, a purified labeled fusion protein 
probe can also be used to determine the amount of or 
to detect an interacting protein or other substance 
in a complex mixture using immunoassay techniques, 
for example. 

In one embodiment, the binding of a specifically 
interacting protein (e*g, , an antigen, hormone) or 
other substance to an immobilized (e.g. , bound to an 
affinity matrix) fusion protein of the present 
invention can induce a change in the attributes of 
the bound fusion protein. Other substances include, 
but are not limited to agents such as a drug, toxin, 
substrate or other ligands capable of interacting 
with the selected polypeptide portion of the fusion 
protein. The difference between the unbound versus 
the bound state could be taken advantage of, as for 
example, in an assay format. For example, the 
presence of the interacting protein or other 
substance in a complex mixture (e.g., blood) could be 
monitored. For example, binding of an interacting 
protein could induce a conformational change that 
shields the labeling (e.g. phosphorylation) site. 
The complex mixture containing an interacting protein 
is contacted with the fusion protein, which is bound 
to the affinity matrix. The combination is washed, 
equilibrated with a suitable reaction buffer and 
subjected to the labeling procedure. The extent of 
labeling is compared to a control which lacks the 
interacting polypeptide and the presence of the 
interacting protein in the complex mixture is 
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indicated by inhibition of labeling- By comparison 
with the degree of inhibition achieved by a standard, 
such a procedure can also provide information 
regarding the quantity of interacting protein which 

05 is present. 

The vectors and methods of the present invention 
can facilitate the characterization of cDNA clones. 
For example, an uncharacterized cDNA can be inserted 
into a vector of the present invention allowing rapid 

10 purification of the protein, without specific 

knowledge of its properties. The protein can be 
labeled with the maintenance of biological activity, 
and the labeled fusion protein can be used to detect 
potential interactions with other cellular proteins. 

15 The present invention will now be illustrated by 

the following Examples, which are not intended to be 
limiting in any way. 

Introduction to Examples 1-4 

Recently it has been demonstrated that the 

20 adenovirus ElA protein, SV4 0 and polyomaviral T 
antigens, and the human papilloma virus E7 protein 
can bind to the retinoblastoma susceptibility gene 
product, pRB. Neither these viruses nor these 
proteins are thought to be closely related to one 

25 another, yet each of these proteins shares a short, 
homologous, colinear, transf oinning sequence, typified 
by ElA conserved region 2 {CR2) , which appears to be 
responsible for high affinity binding to pRB. The 
region of pRB which interacts with this motif has 
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been mapped. To date, all spontaneously occurring, 
loss of function, pRB mutations which do not grossly 
compromise pRB stability, map to this region. This 
led to the hypothesis that pRB, in the course of 

05 regulating cell growth, must interact with one or 
more cellular proteins bearing a seguence 
structurally resembling the viral pRB-binding motif. 
If such a model were correct, one would predict that 
pRB inactivation might be achieved either as a result 

10 of competing complex formation with one of the above 
viral oncoproteins, or as a result of RB mutations 
affecting the T/ElA/E7-binding domain. 
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Example l 

Construction of pGEX-2TK 

Two synthetic oligonucleotides were designed, 
which, on annealing, encode a cAMP-dependent protein 
kinase recognition site and three restriction sites. 
Codon utilization for the synthetic duplex was based 
on the prokaryotic codon utilization data of 
(Grantham, R. et al . , Nucleic Acids. Res . 9: r43-r74 
(1981) ) . 

The oligonucleotides were synthesized using 
standard techniques and were annealed in vitro (Gait, 
M.C.J. , (1984) Oligonucleotide Synthesis; A 
Practical Approach. (I.R.L. Press; Oxford)). The 
resulting duplex has 5 '-overhangs compatible with 
15 BamHI- and EcoRI-cut DNA and is shown below: 

ArgArgAlaSerVal 
5 ' -GATCTCGTCGTGCATCTGTTGGATCCCCGGG 

AGCAGCACGTAGACAACCTAGGGGCCCTTAA-5' 

Plasmid PGEX-2T (Pharmacia) was linearized with BamHI 
and EcoRI, and the vector fragment was ligated to the 
synthetic duplex to make plasmid pGEX-2TK. 
Incorporation of the duplex into pGEX-2T was 
confirmed by restriction and DNA sequence analysis. 
DNA sequencing was performed using a Sequenase 2.0 
kit (United States Biochemical Corp.) with the 
protocol provided by the manufacturer. The structure 
of PGEX-2TK in the region surrounding the kinase site 
is shown in Figure i. 
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As can be seen in Figure 1, the incorporation of 
the synthetic duplex DNA into pGEX-2T resulted in the 
insertion of codons for a cAMP-dependent protein 
kinase recognition sequence, Arg-Arg-Ala— Ser-Val, 
05 imiaediately downstream of the thrombin recognition 
site present in parent plamid pGEX-2T. In addition, 
the synthetic duplex encoded a multiple cloning site 
(MCS) downstrecun of the kinase recognition sequence, 
such that the insertion of the duplex led to the 
regeneration of the MCS of the parent plasmid, with 
restriction sites for BamHI, Smal, and EcoRI- As in 
the parent plasmid, the MCS is followed by a sequence 
with stop codons in all three reading frames. 

Example 2 

15 Generation of "^^P-GST Fusion Proteins 

Construction of pGEX-2TK Vectors Encoding RB Proteins 

RB cONAs encoding residues 379-792 or residues 
379-928 of the retinoblastoma susceptibility gene 
product, both of which span the T/ElA-binding region, 

20 were subcloned into pGEX-2TK. In addition, an RB 
cDNA encoding residues 379-928, having a naturally 
occurring, loss of function, RB point mutation 
(379-928 f 70eF) , which is known to abrogate T/EIA 
binding, was subcloned into pGEX-2TK. These pGEX-2TK 

^ recombinants are named pGTK-RB (379-792) , 

pGTK-RB (379-928) and pGTK-RB (379-928 ; 706F) , 
respectively. The encoded fusion proteins are named 
GTK-RB (379-792) , GTK-RB (379-928) and 



3NSDOC(D: <WO ^9303157A1 J_> 



SUBSTITUTE SHECT' 



wo 93/03157 PCr/L'S92/06187 



-43- 

GTK-RB(379-928;706F) , respectively. These RB fusion 
proteins are also referred to as GST-RB fusion 
proteins or RB fusion proteins. 

Two of the RB cDNAs had been generated in an 
05 earlier study by Kaelin et al . (Kaelin, W.G. et al . . 
Mol. Cel l. Biol . 10(7) : 3761-3769 (1990)), 
incorporated herein by reference, using the 
polymerase chain reaction. The RB cDNA inserted into 
PGEX-2TK to make. pGTK-RB( 379-928) corresponds to RB 
10 deletion RB dl 1-378 (Kaelin, W.G. et al . . Mol. Cell. 
Biol. 10(7) ; 3761-3769 (1990)). The RB cDNA inserted 
into PGEX-2TK to make pGTK-RB (379-792) corresponds to 
RB deletion RB dl 1-378 ;793-928 . Residue 379 is a 
methionine residue. 
15 Each amplimer used to generate the cDNAs 

contained a BamHI site, such that the resulting PCR 
product, upon digestion with BamHI, could be ligated 
in frame into the unique BamHI site in pGEX-2T 
(Pharmacia). The 3' amplimer contained a TGA stop 
'0 codon as well. The PCR fragments encoding these two 
CDNAs were each cleaved with BamHI and cloned into 
PGEX-2T, which had been linearized with BamHI, and 
treated with calf intestinal phosphatase. The 
resulting constructs were named pGT-RB (379-928) and 
5 pGT-RB (379-792) . The cDNAs were cleaved from the 
latter constructs using BamHI, and were each 
subcloned into pGEX-2TK, which had been cleaved with 
BamHI. The resulting plasmids are named 
pGTK-RB (379-928) and pGTK-RB (379-792) . Two 
^ additional amino acids were incorporated into the 
sequence at the BamHI site due to the structure of 
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the PGR primers used to clone both cDNAs^ such that 
the amino acid sequence at the junction of the 
phosphorylation (kinase) site and the first RB 
residue (Met^^^) isr 



C-Arg-Arg-Ala-Ser-Val-Gly-Ser-Ala-Thr-Met^^g-) . 

The mutctnt RB cDNA and referred to herein as 
either RB(379-928;706F) or RB(379-928 ;pin706) 
corresponds to the cDNA designated (379-928 ;pin 706) 
constructed by Kaelin et al . (Kaelin, W.G. et al > ^ 
10 Cell 64 ; 521-532 (1^91) ) f also incorporated herein by 
reference. The mutant cDNA fragment was generated by 
PGR of reverse-transcribed mRNA from cells containing 
the mutation. The PGR product was cleaved at the 
unique Ncol and Bsml sites in the RB gene (which span 
15 the 706F mutation), and ligated into pGT-RB (379-928) , 
from which the wild type RB cDNA Ncol-Bsmi segment 
had been excised- The resulting plasmid, 
pGT-RB(379-928;706F> , encoded the 379-928 ;706F cDNA. 
The plasmid was cleaved with BconHI to release the 
cDNA, and the cDNA fragment was subcloned into 
pGEX-2TK which had been cleaved with BaunHI to make 
pGTK-RB (379-928 ; 706F) . The amino acid sequence at 
the junction of the phosphorylation (kinase) site and 



the first RB residue (Met^^^) is also 



(-Arg-Arg-Ala-Ser-Val-Gly-Ser-Ala-Thr-Met^^^- ) . 

The products encoded by all three cDHAs have 
been tested previously for their ability to bind to 
SV40 T antigen, the adenovirus ElA gene product, and 
putative cellular RB-binding proteins, when expressed 
30 as pGEX-2T encoded GST fusion proteins* The fusion 
proteins encoded by pGT-RB (379-928) and 
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pGT-RB( 379-792) were able to bind to T antigen, EXA, 
and the putative cellular RB-binding proteins, while 
the mutant protein encoded by pGT-RB (379-928 ;706F) 
did not (Kaelin, W*G. et al . , Cell 64 ; 521-532 
(1991)). 

Expression and Purification of GST-Fusion Proteins 

The expression of the pGEX-2TK-encoded protein 
and recombinant pGEX-2TK-encoded GST fusion proteins 
in E- coli (DH5a; Bethesda Research Laboratories), 
and the subsequent recovery of the proteins on 
glutathione 3epharose was carried out essentially as 
described by Smith and Johnson (Smith, D.B. and 
Johnson, K.S., Gene 67: 31-40 (1988); Kaelin, W,G. et 
al,. Cell 64; 521-532 (1991)). Fresh overnight 
cultures of E, coli DH5a , transformed with either 
PGEX-2TK or PGEX-2TK recombinants, were diluted 1:10 
in Luria-Bertani (LB) medium containing ampicillin 
(100 //g/ml) and incubated for a total of 5 hours at 
37 »c, with shaking. After 1.0 hour of growth at 37 
**C , isopropyl-)9 -D-thiogalactopyranoside ( IPTG ; 
Bethesda Research Laboratories) was added to a final 
concentration of 0.1 mM. 

For analysis of total bacterial protein content, 
aliquots of each bacterial culture were pelleted in a 
microcentrifuge, were boiled in urea-SDS cracking 
buffer (0.01 M sodium phosphate [pH 7.2], 1% 
^-mercaptoethanol, 1% SDS, and SM urea), and were 
loaded onto an SDS-polyacrylamide gel. Proteins were 
visualized by Coomassie blue staining. 

For phosphorylation of protein and/ or protein 
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recovery using glutathione sepharose (Pharmacia) , 
bacterial cultures were pelleted at 5000 x g for 5 
min at 4 and resuspended 1/10 the original 

culture volume of NETN (20 mM Tris [pH 8.0], 100 inM 

05 NaCl, 1 mM EDTA, 0.5% Nonidet P-40} . The bacteria 
were then lysed on ice by mild sonication and 
subjected to centrifugation at 10,000 x g for 5 min 
at 4 «^C. The clarified bacterial sonicates, 
containing the pGEX-2TK-encoded protein or relevant 

10 GST fusion protein, were rocked for 15-30 minutes at 
4 •'C with glutathione sepharose (20-3 0 pl/ml 
bacterial sonicate) . The glutathione sepharose beads 
(Glutathione Sepharose 4Br Pharmacia) ) had been 
washed three times and resuspended 1:1 v/v in NETN 

15 (see above) supplemented with 0.5% non-fat powdered 
milk prior to use. This step and subsequent steps 
were also carried out at 4^C. 

Phosphorylation, of Proteins 

^0 The sepharose beads, with bound protein, were 

then washed three times with NETN followed by one 
wash with IX HMK buffer (20 mM Tris pH [7.5], 100 mM 
NaCl, 12 mM MgCl^) • The supernatant was aspirated 
with a 23G needle and the sepharose was resuspended 

25 in 2-3 bead volumes of IX HMK buffer containing 1 
unit/#il of the catalytic subunit of cAMP-dependent 
protein kinase (Sigma Chemical Co., St. Louis, MO), 
1 fi^CL/fil -^^P-tATP (6000 Ci/mMole, 10 mCi/ml, New 
England Nuclear) , and 1 mM DTT* The kinase reaction 
was allowed to proceed for 30 minutes at 4 «C, with 
periodic agitation of the sepharose to maintain a 
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suspension. The reaction was terminated by the 
addition of 1.0 ml of HMK stop buffer (10 xnM Na 
Phosphate [pH 8.0], 10 mM Na Pyrophosphate, 10 inM 
EDTA, 1 mg/ml bovine serum albumin) . Following a 
05 brief spin in a microcentrifuge, the supernatant was 
again removed using a 23 G needle and the sepharose 
was washed 5 times with NETN to remove any 
unincorporated label. Incorporation of label can be 
determined while the protein is attached to the bead. 

Elution With Reduced Glutathione 

For further studies, the fusion protein can be 
eluted from the beads. After the final wash the 
residual supernatant was aspirated using a 23 G 
needle and the labeled or unlabeled GST fusion 
15 protein was eluted by rocking the sepharose for 10-15 
minutes in 10-50 bead volumes of 20 mM reduced 
glutathione, 100 mM Tris [pH 8.0], 120 mM NaCl. 

3 2 

Effect of HMK Sequence on P-Incorporation In Vitro 
The ability of the Arg-Arg-Ala-Ser-Val sequence 

20 to convert the GST fusion proteins into substrates 
for the catalytic subunit of cAMP dependent protein 
kinase was tested. GST fusion proteins encoded by 
the PGEX-2TK recombinants, as well as the 
corresponding pGEX-2T constructs, were overexpressed 

25 in bacteria and recovered on glutathione sepharose as 
described above* As described in more detail, above, 
the purified fusion proteins, while still 
non-covalently bound to glutathione sepharose^ were 
incubated with ^^P-7-ATP and the purified catalytic 
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subunit of cAMP-dependent protein kinase for 30 min 
at; 4«*C, after which a kinase stop buffer was added. 
The sepharose, with bound protein , was extensively- 
washed to remove unincorporated labels transferred to 
OS clean eppendorf tubes,, and subjected to Cerenkov 
counting. 

The GST-RB fusion proteins were readily 

phosphorylated in vitro ^ provided that the kinase 

recognition sequence was present. Figure 2 shows the 

10 effect of inserting an HMK sequence on ^^P 

incorporation (cpm) in vitro data on four GST-RB 

fusion proteins. In particular, GST-RB (379-792) and 

GST-RBC379-792;pin706) fusion proteins which were 

encoded by pGEX-2TK and which contained an HMK 

i5 secjuence (hatched bars) , showed significant 

incorporation of "^^P. In contrast, GST-RB (379-792) 

and GST-RB(379-792;pm706) fusion proteins which were 

encoded by pGEX-2T and lacked an HMK secspience (solid 

bars) were not significantly radiolabeled. 

20 In another experiment, fusion proteins from 10 

ml of bacterial culture were recovered on 25 ^^1 of 

glutathione sepharose beads, and were subjected to 

phosphorylation using the same protocol. A 

GST-RB (379-928) fusion protein expressed from 

25 PGEX-2TK and having a kinase site was labeled with 

^^P (3 X 10^ cpm), while the GST-RB(379-928) fusion 

protein expressed from pGEX-2T and lacking a kinase 

site was not comparably phosphorylated (IX 10^ cpm) . 

Similarly, a GST-RB (379-928 ) ;pm706 fusion protein 

expressed from pGEX-2TK and having a kinase site was 

labeled with "^^P (3 X 10^ cpm) , while the 

GST-RB (3 79-9 28) ;pm706 fusion protein expressed from 

pGEX-2T and lacking a kinase site was not comparably 

4 

phosphorylated (1 X 10 cpm) . 
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Exainple 3 

Structural Integrity of Fusion Proteins 

Expression of Fusion Proteins in E, coli 

As discussed above, the GST-RB fusion proteins 
expressed from pGEX-2T-derived pla^mids 
pGT-RB (379-928) and pGT-RB (379-792) were able to bind 
to T antigen, ElA, and the putative cellular 
RB-binding proteins, while the mutant fusion protein 
encoded by pGT-RB(379-928 ;706F) did not (Kaelin, W,G. 
et_al.. Cell 64; 521-532 (1991)). This observation 
suggests that the binding function and structure of 
the RB proteins is not grossly disturbed when they 
are expressed as part of a fusion protein with GST. 

It was further determined that the insertion of 
the Arg-Arg-Ala-Ser-Val (RRASV) sequence did not 
grossly alter bacterial expression of GST-RB fusion 
proteins expressed from pGEX-2TK. For this 
determination, whole cell lysates of IPTG-induced 
cultures were prepared and total bacterial protein 
content was analyzed directly on SDS-polyacrylamide 
gels as described above. Protein from E. coli 
transformed with pGTK-RB (379-928 ) , pGTK-RB (379-792) , 
or pGTK-RB(379-928;pm706) were analyzed. The band 
intensity for each fusion protein was comparable to 
that for the corresponding construct lacking the 
phosphorylation site. Each fusion protein had an 
apparent molecular weight consistent with the size of 
the pRB fragment encoded by the RB cDNA insert and 
the 26 kD GST polypeptide. 
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Purif ica'tion of Fusion Proteins 

The subsequent purification of GST--RB fusion 
proteins expressed from pGEX--2TK was not altered by 
the insertion of the RRASV sequence. Bacterial 
05 sonicates of each of the transformants were also 

prepared, and proteins were subjected to glutathione 
sepharose chromatography as described above. The 
sepharose beads, with bound protein, were then washed 
three times with NETN, and the bound proteins were 
10 eluted by boiling in SDS sample buffer (2% SDS, 10 % 
glycerol, 62 mM l^is [pH 6.8]). The supernatant was 
subjected to electrophoresis on a 10% polyacrylamide 
gel and visualized by Coomassie blue staining. After 
purification, the intensity of the bands for 
15 GTK— fusion proteins expressed in E. coli transformed 
with pGTK-RB (379-928) , pGTO-RB (379-792) , or 
pGTK-RB (379-928 ;pm706) was comparable to that 
observed with for the corresponding constructs 
lacking the kinase site (i.e., pGT-RB (379-928) , 
20 pGT-RB (379-792 ) , or pGT-RB (379-928 ;pm706) , 
respectively) 

The following materials and methods were used in 
subsequent experiments. 

Cells and Culture Conditions 

The retinoblastoma cell line WERI-Rb27 (a gift 
of Dr» Wen-Hwa Lee) Huang, H.-J. et al . , Science 242 ; 
1563-1566 (1988)) and- 293 cells, a human embryonic 
cell line transformed by a fragment of the Adenovirus 
5 genome (Graham, F.L. et_al. , J. Gen. Virol . 36: 
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59-72 (1977)), were grown in Dulbecco's modified 
Eagle's medium (DMEM) with 10% fetal calf sertim 
(Gibco) . AJcata cells were grown in RPMI or DMEM, 
with 10% fetal calf serum (Gibco) . C57B1/6 primary 
05 mouse embryo fibroblasts (MEF) lines expressing 

either wild-type (MEF*Tex) or mutant (MEF*K1) forms 
of T antigen were grown in Dulbecco's modified 
Eagle's medium (DMEM) with 10% fetal calf serum 
(Gibco) and G418 (150 ug/ml) (Ewen, M.E. et__al. , Cell 
10 58: 257-267 (1989)). All cells were grown at 37»C in 

a humidified, 10% CO^-containing atmosphere. 

Radioisotopic labelling of cells and preparation of 

cell lysates was as described previously (Kaelin, 

W.G. et al . . Cell 64: 521-532 (1991)). 

15 Antibodies 

Tissue culture supernatants were the source of 
monoclonal antibodies PAb 419 and M73 (Harlow, E, et 
al. , J. Virol. 39: 861-869 (1981); Harlow E. et al ., 
J. Virol. 55: 533-546 (1985)). The use of these 
antibodies for immunoprecipitation and western 
blotting was described previously (Kaelin, W.G. et 
al.. Cell 64: 521-532 (1991)), except that electro- 
phoretic transfer of proteins to nitrocellulose was 
performed without the addition of methanol to the 
transfer buffer. 

Binding of Fu sion Proteins to T Antigen and EIA 

It was also determined that the insertion of the 
RRASV sequence did not demonstrably alter the binding 
behavior of GST-RB fusion proteins, GTK-RB (379-928) , 
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GTK-RB(379-792) , and GTK-RB (379-928 ;706F) , expressed 
from pGEX-2TK. The pRB binding assays for T antigen 
and ElA binding were performed essentially as 
described by Kaelin et al > (Kaelin et al > , Cell 64 ; 
521-532 (1991) ) . 

Fusion proteins GTK-RB(379-928) and 
GTK-RB (379-792) (bound to glutathione-sepharose) 
retained the ability to bind to SV40 T antigen or 
adenovirus ElA in solution* In contrast, no 
significant binding of T antigen or ElA to the mutant 
RB fusion, GTK-RB (379-928 ;706F) , or GST expressed 
from pGEX-2T{C was observed under the same conditions. 

Integrity of Phosphorylated Fusion Proteins 

To determine whether the kinase reaction led to 
a significant alteration in the RB T/ElA-binding 
region of the GTK-RB fusion proteins, the following 
experiment was performed* Whole cell lysates 
containing wild-type T antigen (MEF^Tex cells) , the 
RB-binding and transformation defective T antigen 
mutant Kl (MEF^^Kl cells) , or ElA (293 cells) were re- 
solved by SDS-polyacrylamide gel electrophoresis and 
transferred to nitrocellulose filters (See protocols 
in Example 4) * GST-RB fusion proteins were kinased 
in vitro and eluted from the glutathione sepharose in 
the presence of reduced glutathione as described 
above*. The eluted labeled protein was incubated 
overnight with nitrocellulose strips cut from the 
filters • The filters were then washed and subjected 
to autoradiography* Hybridization conditions and 
washes were as described below in Example 4, 
Hybridization of Filters* 
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Similar to the unphosphorylated version^ 
■^^P-GTK-RBf 379-792) bound to ElA and wild-type T from 
MEF*Tex cells, but not to the T mutant Kl from MEF*K1 
cells. The presence of equivalent amounts of T and 

05 Kl in this assay was confirmed by immunoblotting with 
a monoclonal antibody directed against T antigen. 

The binding of "^^P-GTK-RB (379-792) protein to 
ElA was inhibited by the presence of a synthetic 
peptide replica of the human papillomavirus E7 

10 RB-binding motif (wild-type E7 residues 16-32) . In 
contrast, a point mutant derivative of this peptide 
with a glu to gin change at residue 26, which is 
known to abrogate in vitro RB binding, was inert as 
an inhibitor. In addition, ^^P-GTK-RB (379-928) , but 

15 not ^^P-GTK-RB (379-928 ) ;706F) , exhibited ElA binding 
in this assay. Thus, it appeared that the structural 
integrity of the T/ElA-binding regions in the GST-RB 
chimeras was preserved during the kinase procedure 
and subsequent elution. 

20 The experiments described in this example 

indicate that the expression, behavior during 
purification and structural integrity of different 
polypeptides inserted into pGEX-2TK is not grossly 
affected by insertion of a kinase sequence or by 

25 subsequent phosphorylation. 
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Example 4 

Screenincr for Proteins Capable of Specific 
Interaction with the Protein of Interest and 
Isolation of Genes Encoding Such Proteins 

05 Library Manipulations 

For prianairy screening , librsories were plated at 
approximately 40,000 pfu/150 mm plate x 30 plates* 
Induction of ^-galactosidase fusion protein 
expression with IPTG impregnated nitrocellulose 

10 filters was performed as described by Singh et al . 

fBiotechniques 7r 252-261 (1989) )• Additional 

library manipulations including screening with 
32 

P-lcibelled cDNA probes, purification of specific 
clones, preparation of recombinant phage DNA, and 
15 sxibcloning of cDNAs into the sequencing vector pBKS 
(Stratagene) were performed using standard 
techniques « 

Preparation of Nitrocellulose Filters for 
Hybr idi zat ion 
^0 Electrophoretic transfer of proteins from 

SDS-polyacrylamide gels to nitrocellulose for wesftern 
blot analysis was carried out in 192 mM glycine, 25 
van Tris(Base), and 0.01% SDS. Plaque lifts were 
performed as described above • The nitrocellulose 
filters were placed, directly into IX HBB (25 mM 
Hepes-KOH[pH7.7], 25 mM NaCl^ 5mM MgCl2,l mM DTT) 
supplemented with 5% non-fat powdered milk and 0«05% 
NP-40 without drying and incubated overnight. This 
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and subsequent manipulations were performed at 4"C 
with gentle rocking. The filters were then denatured 
and renatured as described by Vinson et al . ( Genes 
and Dev . 2: 801-806 (1988)). Processing of multiple 
filters was done in batch. 

Following renaturation, the filters were placed 
in fresh IX HBB supplemented with 5% non-fat powdered 
milk and 0.05% NP-40, and incubated for l hour. The 
filters were then incubated in IX HBB supplemented 
with 1% non-fat powdered milk and 0.05% NP-40 for at 
least 30 minutes prior to hybridization. 

Hybridization of Filters 

To prepare the hybridization solution, protein 
escpression in E» coli (DH5a) transformed with 
PGEX-2TK and p6EX-2TK-derived plasmids were induced 
with IPTG as described above in Example 2. The 
bacteria were then pelleted and resuspended in 1/10 
the original culture volume in Hyb 75 (20 mM Hepes 
[pH 7.7], 75 mM KCl, 0.1 mM EDTA, 2.5 mM MgCl2, 1 mM 
DTT, 0.05% NP-40). The bacterial suspension was 
sonicated with a probe type sonicator (Branson) and 
centrifuged at 10,000 x G for 5 min at 4«»C. For 
western blots the clarified supernatant was used 
undiluted or diluted 1:2 with Hyb 75. For screening 
plaque lifts the clarified supernatant was diluted 
1:2-1:8 with Hyb 75 supplemented with 1% nonfat 
powdered milk. 

The relevant 32P-labeled GST fusion protein was 
added at 100,000-250,000 cpm/ml. Hybridization was 
carried out at 4«»c with gentle rocking overnight. 
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after which the filters were washed three times 
(10-15 min/wash) with Hyb 75 with 1% nonfat powdered 
milk. The filters were then dried, covered with 
Saran wrap and exposed to film at -70*^0 with an 
05 intensifying screen* 

Screening Precleared Lysates for Specifically 
Interacting Proteins 

Previously, a series of GST-RB fusion proteins, 
expressed from pGEX-2T vectors, and non-^covalently 

10 bound to glutathione sepharose, were used to search 
for cellular proteins capable of interacting, 
directly or indirectly^ with the pRB T/E1A/E7 -binding 
domain (Kaelin, W^G. et al >. Cell 64:521-532 (1991))- 
At least 7 cellular proteins were identified which 

15 bound to GST-RB fusion proteins having an intact 
T/ElA/E7-binding domain. These cellular proteins 
failed to bind to GST-RB fusion proteins (with no HMK 
sequence) derived from spontaneously occurring, loss 
of function RB mutations. In addition, the binding 
of these cellular proteins to GST-RB fusion proteins 
was blocked by a synthetic peptide corresponding to 
the pRB-binding /transforming sequence found in SV40 T 
antigen » In contrast, a point mutant derivative of 
this peptide, corresponding to the sequence found in 
the transformation defective T mutant Kl, did not 
inhibit binding. 

It appeared that one or more of these proteins 
may fulfill the criteria for a meaningful pRB 
cellular ligand. Unfortunately, which of the 

30 cellular proteins bound directly to the pRB T/E1A/E7- 
binding domain could not be determined from these 
experiments • 
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To demonstrate which of these cellular proteins 

is capable of interacting directly with RB, an 
35 

S-labeled WERI-Rb27 retinoblastoma cell lysate was 
prepared. The lysate was precleared by passage over 
glutathione sepharose which had been loaded with 
pGEX-2T-encoded GST to remove proteins which may bind 
to GST rather than the protein of interest. GT- 
RB( 379-792) fusion protein was loaded onto the 
glutathione sepharose beads, and the precleared 
lysate was incubated with the bound GT-RB( 3 79-792) 
fusion protein in the presence (for strip 1) or 
absence (for strips 2-4) of wild- type E7 peptide 
(residues 16-32) . Unbound protein was removed by 
washing and the proteins were eluted by boiling in 
15 sample buffer. 

Bound proteins (directly and indirectly bound to 
the RB-bound beads) were loaded in wide wells, 
resolved by SDS-polyacrylamide gel electrophoresis, 
and transferred to nitrocellulose. The filter was 
then cut into adjacent strips and either subjected to 
autoradiography (strips i and 2) or probed with 
^^P-GTK-Rfi(379-792) in the presence (strip 3) or 
absence (strip 4) of the above-mentioned E7 peptide. 
Strips 3 and 4 were then washed, dried, placed under 
^5 saran wrap (which greatly reduces the "^^s signal) , 
and placed under film. 

At least two bands appeared to be capable of 
interacting directly with the ^^P-GTK-RB (379-792) 
probe in a peptide inhibitable manner. These bands 
2° were not detected by "^^P-GTK-RB(379-792) on strips in 
which the lysate was incubated with GT-'RB( 379-792) in 
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the presence of wild-type E7 peptide prior to 
electrophoresis, indicating specificity of binding. 
Furthermore r the ability of these two cellular 
proteins to interact with ^^P-GTK-RB (379-792) in the 



05 filter binding assay was not inhibited by a point 
mutant (Glu^^ to Gln^g) 
defective in RB binding. 



mutant (Glu^^ to Gln^g) version of the E7 peptide 



Screening: Whole-cell Lysates for Specifically 

Interacting Proteins 

10 An unlabeled whole cell lysate prepared from 

WERI-rb27 cells was also probed for cellular proteins 

capable of binding directly to the RB. The unlabeled 

extract was subjected to SDS-polyacrylaunide gel 

electrophoresis and transferred to nitrocellulose for 

15 Western blotting ♦ The filter was cut into strips 

32 

which were probed with selected P-labeled fusion 
proteins. Again, there appeared to be at least 2 
cellular proteins capable of interacting with 

P-GTO-RB fusion proteins in which the T/ElA-binding 

20 region was intact ("^^P-GTK-RBC 379-792) and 
^^P-GTK-RB (379-928 )pm706) fusion proteins)- 

Furthermore, the binding of these cellular 
proteins to wild-type ^^P-GTK-RB(379-792) fusion 
protein could be selectively inhibited by three 

2^ different RB-binding peptides. Peptide replicas of 
the RB-binding sequences found in T antigen (amino 
acids 102-115) , ElA (tyrosine followed by ElA 
residues 115-132) , or E7 (amino acids 16-32) each 
inhibited binding of the labeled probe to the two 
cellular proteins* In contrast, point mutant 
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derivatives of these peptides failed to inhibit the 
interaction of the probe with the filter-bound 
cellular proteins. The particular mutant peptides 
tested included the T antigen peptide with a Glu^^^^ 
to Lys change, the ElA peptide with a Cys^25 ^° 
change, and the E7 peptide with a Glu^g to Gin 
change. Similar results were obtained when this 
assay was performed with a human Burkitt's lymphoma 
cell line (Akata) as well as with normal peripheral 
blood lymphocytes. 

Use of Lab eled GST-fusion Proteins as Probes for the 
Isolation of Genes 

In order to isolate cDNAs encoding cellular 
proteins capable of interacting specifically, and 
directly with RB, an Akata Agtii expression library 
was screened with the "^^P-GTK-RBC 3 79-792) probe as 
described in the protocols above. Six Agtll clones 
(clones 1, 3, 4, 5, 6 and 9) encoding /9-galactosidase 
fusion proteins which bound to ^^P-GTK-RB( 379-792) 
with high affinity were plaque purified and subjected 
to further analysis. 

The binding of the fusion proteins encoded by 5 
of these clones (clones l, 4, 5, 6 and 9) to the 

P-GTK-RB( 379-792) probe was markedly reduced or 
undetectable in the presence of the wild-type E7 
peptide, while the mutant E7 peptide had no effect on 
binding. Furthermore, the fusion proteins encoded by 
all of the clones bound readily to 
"^^P-GTK-RB ( 379-928 ) , whereas binding to 
30 2p_g^j^_j^^3^g_g28;706F) , a protein with a mutation 
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in the T/E7/E1A binding region of RB, was 
undetectable. Thus, binding to these clones 
displayed RB binding behavior similar to cellular 
proteins which interact with RB. 

Cross-hybridization experiments, and subsequent 
sequence analysis, demonstrated that 4 of the clones 
(Clones 1, 3, 4 and 6) contained overlapping cDNA 
fragments derived from a common mRNA. The gene 
encoding this mRNA will be referred to as RBAPl* 

Sequence analysis of clone 5 predicted that it 
encoded a ^-galactosidase leader polypeptide fused to 
the sequence His-Ser-Phe-Leu-Leu-Cys-Asp-Glu-Asn-Val- 
Leu-Asp-Stop. Thus, this fusion protein contains the 
Leu-X-Cys-X-Glu motif common to the viral RB-binding 
Jnotifs. Additional cDHA clones related to clone 5 
were obtained by screening a 293 cell library with 
the clone 5 insert cDNA. Sequence analysis of these 
clones suggested that the short open reading frame in 
clone 5 was generated by the juxtaposition of a 
normally untranslated cDNA segment with the Agtll 
^— galactosidase coding sequence. Thus, while the 
clone may not represent a cellular protein, upon 
expression in Agtll, it encodes a protein with 
properties (e.g- , a Leu-X-Cys-X-Glu motif) common 
with viral RB-binding motifs. 

Clone 9, unlike clone 5, contained a long open 
reading frame, consistent with the possibility that 
it encodes an RB-interacting protein. 
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Additional Characterization of RBAPl 

The retinoblastoma gene product is believed to 
serve as a cell cycle regulatory element* In 
particular, pRB is thought to contribute to the 
regulation of cell cycle progression as cells 
traverse the Gl/S boundary. Consistent with^ a 
possible role as an RB-interacting protein, RBAPl 
message levels appear to respond to cell cycle 
events . 

A Northern blot was prepared of total RNA 
obtained from resting peripheral blood lymphocytes 
(PBL) and from PBL at various time points following 
stimulation with a cocktail containing PMA, PHA, and 
a calcium ionophore. The Northern blot was probed 
with an RBAPl probe from clone 4. A single message 
of about 3.5 kb became detectable 24-36 hours after 
stimulation. 

In a similar experiment, PBL were stimulated to 
enter the cell division cycle in the presence of 
hydroxyurea (HU) . Again, a single 3.5 kb mRNA was 
detected with the RBAPl probe. The abundance of the 
message in RNA from HU- treated PBLs increased to a 
maximum at 36 hours, and then appeared to plateau. 
Furthermore, the level of the mRNA in cells detected 
by the RBAPl probe fell dramatically within 8 hours 
after removal of HU. 



In vitro Binding of RBAPl to pRB 

The RBAPl cDNA was cloned into pGEX-2T and 
expressed in E. coli as a GST-fusion protein. 
Glutathione sepharose beads were loaded with the 
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35 

GST-RBAPl fusion. S-labeled RB proteins were 
prepared by in vitro transcription and translation 
from cDNA clones as described (Kaelin, W.G. et_al • , 
Moi> Cell, Biol . 10(7) : 3761-3769 (1990)). The 
05 labeled RB proteins, RB(379-928) and 

RB (379-928 ;pm706) were incubated with the bound 
GST-RBAPl fusion protein. The beads were washed and 
analyzed for the amount of fusion protein retained. 
Quantitative recovery of the in vitro translated 
^^P-GTK-RB (379-928) protein, but not of the 
"^^P-GTK-RB (379-928 ;pm706) fusion protein was 
observed. This experiment further suggests that the 
RBAPl product is capable of interacting directly with 
the RB gene product. 

^5 Expression of pl07 as a GTK-fusion Protein 

The RB-related protein pl07 was also expressed 
as a GTK-fusion protein* A cDNA encoding a region of 
pl07 which is homologous to the T/E1A/E7 binding 
domain of the RB gene product was inserted into 
pGEX-2TK. Using the protocols described in the above 
examples, a GTK-pl07 fusion protein was expressed in 

coli r a lysate was prepared, the fusion protein 
present in the lysate was captured on a 



20 



glutathione-sepharqse affinity column, and labeled 
25 32 

^ with [ PI in situ in a phosphorylation reaction 

using the catalytic subunit of cAMP-dependent protein 
kinase and [r-^^PJATP. The ['^^P] -labeled GTK-pl07 
fusion protein was eluted from the column, and was 
used as probe in Western blot analysis of whole cell 

30 

extracts. Whole cell extracts (unlabeled) containing 
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ElA were prepared from 293 cells, subjected to 
SDS-polyacrylamide gel electrophoresis, and electro- 
phoretically transferred to filters. Filters were 
cut into strips and probed with (1) anti-ElA 

05 antibody, (2) I"^^P] -labeled GTK-pl07 fusion protein, 
(3) [-^^P] -labeled GTK-pl07 fusion protein in the 
presence of competing wild-type E7 peptide (residues 
16-32), and (4) (-^^P] -labeled GTK-pl07 fusion protein 
in the presence of mutant E7 peptide (residues 16-32, 
with a Glu to Gin mutation at residue 26) . 
Autoradiography indicated that the [P] -labeled 
GTK-pl07 fusion protein, alone or in the presence of 
mutant E7 peptide, was able to detect the ElA product 
on the blots. The specificity of interaction was 

15 indicated by the observation that the ['^^P] -labeled 
GTK-pl07 fusion protein did not detect the ElA 
product in the presence of competing wild-type E7 
peptide . 

Example 5 

^0 Construction of pAR(ARI) 59/60 and Properties of 

pFEK-fusion Proteins 

Construction of pjai(ARI) 59/60 

Plasroid pAR3040 (also referred to as pET3a) was 
the starting material (Studier, Meth. Enzymol . 185 : 
^5 60-89 (1990)). This plasmid and expression plasmids 
derived from this vector can be maintained and 
induced for expression as described by Studier 
(Studier, Meth. Enzymol .185 : 60-89 (1990)). Plasmid 
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PAR304O was cut with EcoRl. The overhangs were made 
blunt by "filling in" using Klenow enzyme and dNTPs, 
and religated to destroy the EcoRI site* In an 
infrequent event, the EcoRI site was regenerated on 

05 propagation of the plasmid. Therefore^ another 
version of of this intermediate was constructed by 
treating with mung bean nuclease to remove the EcoHI 
overhangs prior to religation. Regeneration of the 
EcoRI site in the latter version was not observed. 

10 Both versions of the intermediate construction were 
modified as described below, to make two slightly 
different pAR(ARI) 59-6a vectors (i.e.^ differing in 
the manner in which the EcoRI site was destroyed) • 
These vectors behave identically with respect to 

15 expression of encoded fusion proteins. The 

intermediate vectors were then cleaved with Ndel and 
treated with calf intestinal alkaline phosphatase 
(CIP) • 
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Two complementary oligonucleotide adaptors, MAB 
59 and MAB 60, were synthesized, kinased, and 
annealed. The adaptors were then ligated into the 
Ndel cleaved, CIP-treated vectors. The adaptors 
introduce the 17 amino acid sequence comprising the 
FLAG peptide and HMK site shown in Figure 2. The 
sequence of the adaptors is shown below: 

Met Asp Tyr Lys Asp Asp Asp Asp Lys Ala Arg Arg- 

5'-T ATG GAC TAG AAA GAC GAT GAC GAT AAA GCA AGA AGA- 
AC CTG ATG TTT CTG CTA CTG CTA TTT CGT TCT TCT- 

Ala Ser Val Gin Phe- 

GCA TCT GTG GAA TTC CA 

CGT AGA CAC CTT AAG GT AT-5 ' 

Plasmids having one insert of the double-stranded 
adaptor were identified by restriction analysis of 
miniprep DNA. An Xbal-BamHI double digest releases a 
fragment comprising the inserted sequence. The 
structure of resulting plasmids, named pAR(ARl) 59/60, 
was confirmed by dideoxy sequencing using 
oligonucleotide primers MAB 58 
(5'-GCAGCCAACTCAGCTTC-3') and MAB 69 
(5'-TTAATACGACTCACTAT-3 ' ) . 

Construction of Derivatives of pAR(ARI) 59/60 

Four derivatives of pAR(ARI) 59/60 were prepared. 
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The first derivative encoding a selected polypeptide 
was prepared by the insertion of a 1*5 kb EcoRI 
fragment of a shPan-1 (N3) cDNA into pAR(ARI) 59/60, 
which had been cleaved with EcoRI and treated with 

05 ciP. The shPan-1 (N3) cDNA encodes a DNA binding 
protein r as described by German et al > (German et 
al.^ Molec> Endocrinol . 5: 292-299 (1991)). 

Plasmids containing the insert were detected by 
restriction analysis of miniprep DNA using a 

^0 BamHI-StuI double digest. The correct orientation 
yielded a diagnostic 500 bp fragment* The sequence 
at the junctions between the vector and insert was 
verified by sequencing, using oligonucleotide primers 
MAB 58 and MAB 69 (see above) . The plasmid is 

^5 referred to herein as pFEK-shPan-1 . 

The second derivative of pAR(ARI) 59/60 was made 
by the insertion of N3-SH, a fragment of shPan-1 
generated by PGR (German et al * , Molec , Endocrinol . 
5r 292-299 (1991) ) . For cloning, the fragment was 
generated by FOR, using oligonucleotide primers MAB 
72 (5'-GGCCGAATTCTCCTGGTCCCACGGAGACCC-3') and MAB 73 
(5'-6GCCGAATTCGCCGAGGAGGACAAGAAGGACC-3') • The PGR 
products were purified on an agarose gel, 
electroeluted, treated with T4 DNA polymerase, euid 
digested with EcoRI. The resulting fragment was 
ligated to EcoRI-cut, CIP-treated pAR(ARI) 59/60. 
Constructs with single inserts were identified by 
Xbal-BamHI double digests of miniprep DNA. The 
junctions and sequence of the insert were verified by 

30 dideoxy sequencing using oligonucleotides MAB 58 and 
MAB 69 (see above) . The resulting construct is 
referred to herein as pFEK-N3-SH. 
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The third derivative of pAR(ARI) 59/60 was 
constructed by the insertion of a DNA sequence 
encoding amino acids 120 through 206 of the rat c-fos 
protein (Kouzarides, T. and E. Ziff , Nature 336 ; 
05 646-651 (1988); Curran, et al . , Oncogene 2: 79-84 

(1987) ; Nakabeppu, Y. and D. Nathans, EMBO J . 8: 
3833-3841 (1989)). The specific fragment used for 
cloning was generated by the polymerase chain 
reaction (PGR) , using oligonucletide primers MAB 70 
(5'-GGCCGAATTCGCGCAGAGCATCGGCAGAAG-3') and MAB 71 
(5'-GGCCGAATTCCTACTAGATCTTGCAGGCAGGTCGGT-3 ' ) . The 
resulting PGR products were purified on an agarose 
gel, electroeluted, treated with T4 DNA polymerase, 
and digested with EcoRI. The resulting fragment was 

^5 ligated into EcoRI-cut, GIF-treated pAR(ARI) 59/60 • 
Isolates with single inserts were identified by Xbal 
and BamHI double digests. The junctions and insert 
sequence were verified by dideoxy sequencing using 
the oligonucleotide primers MAB 58 and MAB69 (see 

20 

above) . The resulting plasmid is referred to herein 
as pFEK-c-fos(120-206) . 

The fourth derivative of pAR{ARI) 59/60 was 
derived by the insertion of amino acids encoding 
residues 206 through 340 of the human c-jun protein 

25 

(Kouzarides, T. and E. Ziff , Nature 336 ; 646-651 

(1988) ; Bohmann, D. et_al. , Science 238: 1386-1392 
(1987)). The fragment for cloning was generated by 
PGR using oligonucleotide primers MAB 74 
(5'-GGGGGAATTGTTTCGGGGGCAAGGGGAGGA-3') and MAB 75 

30 (5 ' -GGGGGAATTGCGGACGGTGTCTCTTCAAA-3 ' ) . The PGR 
products were purified on an agarose gel, 
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electroeluted, treated with T4 DNA polymerase, and 
digested with EcoRl. The resulting fragment was 
ligated to EcoRI-cut, CIP-treated pAR(ARI) 59/60. 

Constructs having a single insert were again 
identified by Xbal-BamHI digestion of mini-prep DNA. 
The Junctions and insert sequences were confirmed by 
dideoxy seuqencing using primers MAB 58 and MAB 69 
(see above) . The resulting plasmid is referred to 
herein as pFEK-c-jun (206-340) . 



10 



15 



20 



pFEK-fusion Proteins Retain Biological Activity and 
Are Efficiently Phosphorylated 

As each of the selected polypeptides expressed 
from pAR(ARI) 59/60 is a DNA binding protein, the 
activity of the selected polypeptide portions, in the 
context of a fusion protein comprising an affinity 
ligand and modification site, were assayed by 
electrophoretic mobility shift assays (EMSA) . Assay 
conditions for the FEK-shPAN-i fusion protein encoded 
by pFEK-shPan-1 and by pFEK-N3-SH, were as described 
by German et al . (German et al . , Molec. Endocrinol . 
5: 292-299 (1991)). EMSA conditions for the fusion 
protein encoded by pFEK-c-fos( 120-206) were as 
described (Kouzarides, T. and E. Ziff , Nature 336 ; 
646-651 (1988) ? Curran, T. et al . , Oncogene 2: 79-84 
25 (1987); Nakabeppu, Y. and D. Nathans, EMBO J . 8: 
3833-3841 (1989)). 

EMSA conditions for the fusion protein encoded 
by pFEK-c-jun (206-340) were as described (Kouzarides, 
T. and E. 2iff, Nature 336 ; 646-651 (1988); Bohmann, 
30 D. et al .. Science 238 ; 1386-1392 (1987)). The DNA 
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binding activity of the PEK-c-jun(206-340) fusion 
protein present in bacterial extracts, was assayed in 
combination with bacterial extracts containing fos 
"core" (to assay binding of jun as a jun-fos 

05 heterodimer) . 

The four unlabeled fusion proteins were made by 
in vitro transcription/ translation- The particular 
DNA probes used in each shift assay were radiolabeled 
for detection of DNA-fusion protein complexes. As a 

10 positive control, each of the native (i.e., not 

fused) DNA binding proteins was prepared by in vitro 
transcript ion/ translation and subjected to the shift 
assay. The short polypeptide produced by 
pAR(z^RI) 59/60, and the transcription/translation 

15 products from a vector control for each 

pFEK-construct in which the selected polypeptide 
insert was in a reverse orientation, provided 
negative controls for DNA binding. As indicated by 
the shift assay, all four pFEK-fusion proteins bound 

20 DNA to an extent comparable to the corresponding 
native protein. These results indicate that the 
incorporation of the selected polypeptides (fos, jun, 
shPan-l (N3) , and N3-SH) into fusion proteins of the 
present invention comprising a modification 
recognition site, did not alter the biological 
activity (DNA binding) of these proteins. 

In parallel to the experiment just described, in 
vitro transcribed and translated fusion proteins were 
modified by phosphorylation with HMK kinase and 

30 non-radioactive ATP (see below for conditions) . 
Modification by phosphorylation could be 
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distinguished because the negative charge of the 
additional phosphate group on the modified fusion 
protein led to an alteration in migration of the 
DNA-fusion protein complex as compared with the 
05 unmodified fusion protein-DNA complex. Modification 
of all four of the fusion proteins did not alter the 
extent of complex formation^ indicating that the 
biological activity of the selected polypeptide 
portion is not altered by modification. 

^0 Induction of Protein Expression in BL21 Bacteria 
Induction of expression from pAR(ARI) 59/ 60 
and pFEK- plasmids was done essentially as described 
by studier et al. (Studier, Meth. Enzvmol . 185 ; €0-89 
(1990)). Briefly, plasmids were transformed into 

i5 competent BL21 or BL21 pLysS bacteria. A single 
colony was inoculated into Luria broth with 
ampicillin at 50 ug/ml and with chloramphenicol at 25 
ug/ml. Growth at 37<»C was allowed to proceed until 
an optical density (A^^^) of approximately 1.0 was 
attained. IPTG was added as described by Studier 
( Meth. Enzvmol . 185 : 60-89 (1990)). Cells were grown 
for another 3 hours or so at 37**C. A bacterial 
lysate was prepared using standard techniques. 

Following expression in E. coli , the 
unfractionated bacterial extract from transformants 
carrying expression vector pFEK--shPan-l, pFEK-N3-SH, 
pFEK-c-fos (120-206) , or pFEK-c-jun (206-340) was 
assayed by electrophoretic mobility shift assay as in 
the previous section. The results indicated that the 
unmodified or modified (by phosphorylation with 
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unlabeled ATP) FEK-fusion proteins expressed in 
bacterial extracts retained the biological activity 
of the selected polypeptide portion. 

Partial Purification of Proteins Produced ; 

Following expression in E. coli , bacterial 
extracts from cells transformed with a pFEK- vector 
were fractionated to obtain partially purified 
FEK-fusion proteins by one of two methods. 

A) DEAE-Sephacel chromatography: 

-used as per directions of supplier 

(Pharmacia-PL) 
-buffer contained 100 mM NaCl and 10% glycerol 

(in addition to the buffer constituents 

presents in the bacterial lysates) 
-treated in "batch" for 60 minutes at +4^C 
-lysate was incubated with resin with 
continuous gentle agitation 
-supernatant solution passed over a 2 ml 

column of resin 
-flow-through collected 

The fusion proteins encoded by vectors 
pFEK-shPan-1 and pFEK-N3-SH were obtained using 
DEAE-Sephacel chromatography at -50% purity. 

B) Heparin-Sepharose chromatography: 
-used as per directions of supplier 

( Pharmac i a -PL ) 
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-DEAE flow-through fraction from protocol (A) 
was loaded onto a 1 ml bed volume column of 
resin 

-resin washed with 10 column volumes of load 
05 buffer (i*e. , 100 mM salt) 

-proteins were eluted with 1 column volume each 
of 200, 300, 400, 500, 600, 700, 800, 900 and 
1000 mH salt 
—fractions were tested for the presence of 
10 active protein by the appropriate assay 

For example, in the case of a fusion protein 
comprising the fos '^core'^ (encoded by 
pFEK-c-fos(120-206J ) , activity of fractions was 
assayed by electrophoretic mobility shift assay 
15 together with reticulocyte-produced c-jun protein. 
Peak activity for the fusion protein comprising the 
fos 'core' fusion protein was eluted at approximately 
500-600 mM salt yielding a preparation with -50% 
purity • 

20 Phosphorylation of Proteins with HMK r 

All four fusion proteins produced by the pFEK- 
vectors (pFEK-shPan-1 , pFEK-N3-SH, 

pFEK-c-fos (120-206) , and pFEK-c-jun (2 06-340) ) were 
capal:>le of being modified by phosphorylation when 
25 present in crude bacterial extracts. As discussed 

above, modification by phosphorylation with unlabeled 
ATP was observed. In addition, modification using 
radioactively- labeled ATP was observed. 
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Proteins which were partially purified by the 
methods (A and B) described above were also 
efficiently radiolabeled by phosphorylation using HMK 
enzyme and [7-^^P]ATP, Labeling to a specific 
05 activity of approximately 10^ to 10® cpm per fig of 
protein was observed for partially purified 
FEK-fusion proteins encoded by pFEK-shPan-l, 
PFEK-N3-SH, and pFEK-c-fos (120-206) . The protocols 
used for protein labeling are described below. 

10 Phosphorylation with Heart Muscle Kinase : 

Sigma # P-2645 Protein Kinase, Catalytic subunit 
from bovine heart was obtained as lyophilized powder 
in 250 unit vials. Typically, 250 units of HMK 
enzyme was resuspended in 25 /zl of 40 mM DTT (i.e., 

15 at 10 u/m1) • The solution was allowed to stand at 
room temperative for 10 minutes and was stored at +4 
**C. Activity was stable for 2-3 days, but HMK was 
usually freshly reconstituted for each use. 

A lOx preparation of HMK Buffer was prepared 

20 (200 mM Tris-Cl (pH 7.5), 10 mM DTT, 1 M NaCl, and 
120 mM MgCl^) . The phosphorylation reaction mixture 
was as follows: 

3 /il (lOx) HMK buffer 

2-5 Ml of [7^^P] (e.g., NEN-Dupont #NEG-035C 
25 [7^^P]^ATP >7000Ci/mmol) 

1-10 ul of protein extract (amount will vary with the 

particular protein) 
1 fil of 10 u/m1 HMK 
water to a total of 30 ^1 
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The reaction mixture was incubated at 37 "C for 30-60 
minutes, and was stored on ice until use. Some 
proteins were labeled almost as well when the 
incubation was carried out on ice. The low 
temperature incubation can serve to preserve the 
biological activity of the modified fusion protein. 
These procedures can be scaled up as required. 

G50 column c hromatography to Remove Unincorporated 
Label 

FEK-fusion proteins were used for Western 
blotting or for plague screening. For these 
applications, the fusion protein was run a G50 column 
as described below in order to remove unincorporated 
label following the HMK phosphorylation reaction. 

15 "Z + 0.1 M KCl" buffer (25 mM Hepes-KOH ([pH 

7.7], 12.5 mM MgCl^, 20% glycerol, 100 mM KCl) was 
prepared. For solutions comprising Z + KCl, and BSA, 
the solution was filtered through a 0.2 fin filter 
prior to use. DTT was added to a final concentration 

20 of 1 mM just prior to use. 

G50 fine (pre-swollen in water) was washed (IX) 

with 5 ml of (Z + 0.1 M KCl + 3 mg/ml BSA + i aM 
DTT) and resuspended at [i:i] in the sane buffer 
-the beads were rotated at RT for -i hour, and washed 
25 three times with (Z + 0.1 M KCl + i mg/ml BSA + 

1 mM DTT) 

-a 1 ml sterile plastic pipette was packed with the 
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washed GSO (bed vol -1.2 ml) and equilibrated at 

room temperature with -50-10 ml of (Z + 0.1 M 

KCl + 1 mg/ml BSA + i mM DTT) 
-immediately prior to loading the column, the 

total volume of the HMK reaction was brought to 

100 fil with ice-cold (Z + o.i M KCl + i mM DTT) 
-the column was loaded and run at room temperature, 

taking i drop fractions (-45 ^I/drop) 
-fractions were stored immediately on ice 
-2-5 nl aliquots were removed from each fraction and 

counted by Cerenkov counting 
-the excluded peak fractions (usually at 1/3 to 1/2 
the column volume) were pooled 

optionally aliquots of the fractions (e.g., i.o m1) 
can be fractionated on SDS-polyacrylamide gels to 
monitor the progress of the fusion protein. The 
above procedure can also be carried out at 4» C where 
desired. 



Equivalents 

Those skilled in the art will be able to 
recognize, or be able to ascertain, using no more 
than routine experimentation, many equivalents to the 
specific embodiments of the invention described 
herein. Such equivalents are intended to be 
encompassed by the following claims. 
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An expression vector comprising a nucleotide 
sequence which encodes at least one affinity 
ligand, at least one modification recognition 
sequence and at least one restriction site for 
the in frame insertion of a nucleotide sequence 
encoding a selected polypeptide. 

The expression vector of Claim 1 wherein the 
affinity ligand is a glutathione-S-transferase 
affinity ligand or a FI*AG epitope affinity 
ligand r and the modification recognition 
sequence is a phosphorylation site. 

The expression vector of Claim 1 fiirther 
comprising a nucleotide sequence which encodes a 
cleavable linker located between the affinity 
ligand and the restriction site for insertion of 
a selected polypeptide. 

The expression vector of Claim 1, wherein the 
affinity ligand is a glutathione-S-transf erase 
affinity ligand or a FLAG epitope affinity 
ligand, the modification recognition sequence is 
a phosphorylation site, and the cleavable linker 
is selected from the group consisting of a 
thrombin cleavage site. Factor Xa cleavage site, 
and enter okinase cleavage site* 
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The expression vector of Claim 4, wherein the 
vector is pGEX-2TK. 

The expression vector of claim 4, wherein the 
vector is pAR(ARI) 59/60. 

An expression vector comprising a nucleotide 
sequence which encodes a fusion protein 
comprising an affinity ligand, a modification 
recognition sequence and a selected polypeptide! 

The expression vector of Claim 7, wherein the 
affinity ligand is a glutathione-S-transf erase 
affinity ligand or a FLAG epitope affinity 
ligand, and the modification recognition 
sequence is a phosphorylation site. 

The expression vector of Claim 7 wherein the 
nucleotide sequence further encodes a cleavable 
linker located between the affinity ligand and 
the selected polypeptide* 

The expression vector of Claim 9, wherein the 
affinity ligand is glutathione-S-transf erase 
affinity ligand or a FLAG epitope affinity 
ligand, the modification recognition sequence is 
a phosphorylation site, and the cleavable linker 
is selected from the group consisting of a 
thrombin cleavage site, factor Xa cleavage site 
and enterokinase cleavage site. 
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11. A method of producing a labeled fusion protein 
comprising: 

a) expressing a fusion protein comprising an 
affinity ligand, a phosphorylation 

05 recognition sequence and a polypeptide in a 

host cell using an expression vector; 

b) lysing the host cells to obtain a lysate 
containing the fusion protein; 

c) contacting the lysate vith an affinity 

10 matrix under conditions sufficient to bind 

the affinity ligand portion of the fusion 
protein to the affinity matrix; 

d) washing the product of step (c) to remove 
unbound material; and 

15 e) contacting the bound fusion protein with 

[7--labeled}ATP and a kinase reaction buffer 
comprising a protein kinase under 
conditions appropriate for phosphorylation 
of the bound fusion protein to thereby 

20 produce a labeled fusion protein which is 

bound to the affinity matrix. 

12. The method of Claim 11 further comprising the 
steps of: 

f ) optionally stopping the reaction of step 
25 (e> by addition of a stop buffer; 

g) washing the affinity matrix with 
labeled fusion protein bound thereto to 
remove unincorporated label; and 



BNSDOCID: <WO ^93031 57A1_L> 



SUBSTITUTE SHEPT 



wo 93/03157 



PCr/US92/0(M87 



-79- 



10 



h) eluting the labeled fusion protein 

with a suitable elution buffer comprising a 
release component thereby obtaining a 
labeled fusion protein. 

13. The method of Claim 11 further comprising the 
steps of: 

f ) optionally stopping the reaction of step 
(e) by addition of a stop 
buffer; 

g) washing the bound labeled fusion protein to 
remove unincorporated label; and 

h) cleaving the labeled fusion protein with a 
site specific protease to produce a labeled 
selected polypeptide portion. 

15 14. A method of producing a labeled fusion protein 
comprising: 

a) expressing a fusion protein comprising a 
GST affinity ligand, a phosphorylation 
recognition sequence and a polypeptide in a 
host cell using an expression vector; 

b) lysing the host cells to obtain a 
lysate containing the fusion protein; 

c) contacting the lysate with immobilized 
glutathione under conditions sufficient to 
bind the GST portion of the fusion protein 
to bind to glutathione, thereby obtaining 
immobilized glutathione with the bound 
fusion protein; 
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d) washing the iimnobilized glutathione with 
bound fusion protein to remove unbound 
material ; and 

e) contacting the bound fusion protein with 
[-f-labeled]ATP and a kinase reaction buffer 
comprising a protein kinase under 
conditions appropriate for phosphorylation 
of the bound fusion protein to thereby 
produce immobilized glutathione with 
labeled fusion protein which is bound to 
immobilized glutathione. 

15 • The method of Claim 14 wherein the protein 
kinase is the catalytic subunit of a 
cAHP-dependent protein kinase. 

16. The method of Claim 14 wherein the [7 -labeled] ATP 
is [-r-*^^P]ATP. 

17- The method of Claim 14 further comprising the 
steps of: 

f ) optionally stopping the reaction of step 
(e) by addition of a stop buffer; 

g) washing the immobilized glutathione with 
bound labeled fusion protein to remove 
xinincorporated label; and 

h) eluting the bound labeled fusion 
protein with a solution comprising reduced 
glutathione thereby obtaining a labeled 
fusion protein. 
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18. The method of Claim 14 further comprising the 
steps of: 

f) optionally stopping the reaction of step 
(e) by addition of a stop buffer; 

g) washing the immobilized glutathione with 
bound labeled fusion protein to remove 
unincorporated label; and 

h) cleaving the labeled fusion protein with 
thrombin to produce a labeled selected 

^° polypeptide portion. 

19. A kit for preparing a radiolabeled 

glutathione-S-transferase (GST) fusion protein 
comprising: 

a) an immobilized glutathione affinity matrix; 

b) a wash buffer which permits binding of a 
GST-fusion protein to the affinity matrix; 

c) a kinase reaction buffer; 

d) a protein kinase preparation; 

e) an elution buffer comprising reduced 
glutathione for releasing the fusion 
protein from the affinity matrix; and 
optionally 

f) a reaction stop buffer. 



15 



20 



20. 

25 



A labeled fusion protein comprising an affinity 
ligand, a modification recognition sequence, and 
a polypeptide. 
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21. The labeled fusion protein of Claim 20, wherein 
the affinity ligand is a 

glutathione-S-transferase affinity ligand and 
the modification recognition sequence is a 
phosphorylation recognition sequence. 

22. The labeled fusion protein of Claim 20, wherein 
the affinity ligand is the FLAG epitope, and the 
modification recognition sequence is a 
phosphorylation recognition sequence. 

23. A host cell containing the expression vector of 
Claim 1. 
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N- Met- FLAG - HMK -Glu-Phe- 



Protein 



Ente rokinose 

FLAG peptide 



■^Sp Met Asp Tyr Lys Asp Asp Asp Asp Lys A/a 

. . AAGGAG ATATA CATATG GAG TAG AAA GAG GAT GAG GAT AAA GC A 

Ndel 

HMK recognition 

Arg Arg Ala Ser Vat Glu Phe 
AGA AG A GCA TOT GTG GAA TTC CATATG 

EcoKl Ndel 

. ^ 
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